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CHEMICAL INHIBITORS OF MISMATCH REPAIR 

TECHNICAL FIELD OF THE INVENTION 

The invention is related to the area of mutagenesis. In particular it is related to the 
5 field of blocking specific DNA repair processes. 

BACKGROUND OF THE INVENTION 

Mismatch repair (MMR) is a conserved DNA repair process that is involved in 
post-replicative repair of mutated DNA sequences mat occurs after genome replication. 

10 The process involves a group of gene products, including the mutS homologs GTBP, 

hMSH2, and hMSHS and the mutL homologs hMLHl, hPMSl, and hPMS2 (Bronner, C.E. 
et al. (1994) Nature 368:258-261; Papadopoulos, N. et al. (1994) Science 263:1625-1629; 
Leach, F.S. et al (1993) CeH 75:1215-1225; Nicolaides, N.C. et al (1994) Nature 
371:75-80) that work in concert to correct mispaired mono-, di-, and tri-nucleotides, point 

15 mutations, and to monitor for correct homologous recombination Germline mutations in 
any of the genes involved in this process results in global point mutations, and instability 
of mono, di and tri-nucleotide repeats (a feature referred to as microsatellite instability 
(MI)), throughout the genome of the host cell. In man, genetic defects in MMR results in 
the predisposition to hereditary nonpolyposis colon cancer, a disease in which tumors 

20 retain a diploid genome but have widespread MI (Bronner, C.E. et al (1994) Nature 
368:258-261; Papadopoulos, N. et al (1994) Science 263:1625-1629; Leach, F.S. et al 
(1993) Cell 75:1215-1225; Nicolaides, N.C. et al (1994) Nature 371:75-80; Harfe B.D., 
and S. Jinks-Robertson (2000) An. Rev. Genet. 34:359-399; Modrich, P. (1994) Science 
266:1959-1960). Though the mutator defect that arises from MMR deficiency can affect 

25 any DNA sequence, microsatellite sequences are particularly sensitive to MMR 

abnormalities (Peinado, M.A. et a/.(1992) Proc. Natl Acad. Sci. USA 89:10065-10069). 
Microsatellite instability is therefore a useful indicator of defective MMR In addition to 
its occurrence in virtually all tumors arising in HNPCC patients, MI is found in a small 
fraction of sporadic tumors with distinctive molecular and phenotypic properties that is 

30 due to defective MMR (Perucho, M. (1996) Biol Chem. 377:675-684). 

MMR deficiency leads to a wide spectrum of mutations (point mutations, 
insertions, deletions, recombination, etc.) that can occur throughout the genome of a host 
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cell. This effect has been found to occur across a diverse array of organisms ranging from 
but not limited to unicellular microbes, such as bacteria and yeast, to more complex 
organisms such as Drosophila and mammals, including mice and humans (Harfe B.D., and 
S. Jinks-Robertson (2000) An. Rev. Genet. 34:359-399; Modrich, P. (1994) Science 

5 266: 1959-1960). The ability to block MMR in a normal host cell or organism can result in 
the generation of genetically altered offspring or sibling cells that have desirable output 
traits for applications such as but not limited to agriculture, pharmaceutical, chemical 
manufacturing and specialty goods. A chemical method that can block the MMR process 
is beneficial for generating genetically altered hosts with commercially valuable output 

10 traits. A chemical strategy for blocking MMR in vivo offers a great advantage over a 

recombinant approach for producing genetically altered host organisms. One advantage is 
that a chemical approach bypasses the need for introducing foreign DNA into a host, 
resulting in a rapid approach for inactivating MMR and generating genetically diverse 
offspring or sib cells. Moreover, a chemical process is highly regulated in that once a host 

15 organism with a desired output trait is generated, the chemical is removed from the host 
and its MMR process would be restored, thus fixing the genetic alteration in subsequent 
generations. The invention described herein is directed to the discovery of small 
molecules that are capable of blocking MMR, thus resulting in host organisms with MI, a 
hallmark of MMR deficiency (Peinado, M.A. et al (1992) Proc. Natl. Acad. Sci. USA 

20 89:10065-10069; Perucho, M. (1996) Biol Chem. 377:675-684; Wheeler, J.M. et al 
(2000) 1 Med. Genet 37:588-592; Hoang, J.M. etal (1997) Cancer Res. 57:300-303). 
Moreover, host organisms exhibiting MI are then selected for to identify subtypes with 
new output traits, such as but not limited to mutant nucleic acid molecules, polypeptides, 
biochemicals, physical appearance at the microscopic and/or macroscopic level, or 

25 phenotypic alterations in a whole organism. In addition, the ability to develop MMR 
defective host cells by a chemical agent provides a valuable method for creating 
genetically altered cell hosts for product development The invention described herein is 
directed to the creation of genetically altered cell hosts via the blockade of MMR using 
chemical agents in vivo. 

30 The advantages of the present invention are further described in the examples and 

figures described within this document. 
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SUMMARY OF THE INVENTION 

The invention provides methods for rendering cells hypermutable by blocking 
MMR activity with chemical agents. 
5 The invention also provides genetically altered cell lines which have mutations 

introduced through interruption of mismatch repair. 

The invention further provides methods to produce an enhanced rate of genetic 
hypermutation in a cell. 

The invention encompasses methods of mutating a gene of interest in a cell, 
10 methods of creating cells with new phenotypes, and methods of creating cells with new 
phenotypes and a stable genome. 

The invention also provides methods of creating genetically altered whole 
organisms and methods of creating whole organisms with new phenotypes. 

These and other objects of the invention are provided by one or more of the 
15 embodiments described below. 

In one embodiment of the invention, a method for screening chemical compounds 
that block mismatch repair (MMR) is provided An MMR-sensitive reporter gene 
containing an out-of-frame polynucleotide repeat in its coding region is introduced into an 
MMR proficient cell. The cell is grown in the presence of chemicals. Chemicals that alter 
20 the genetic structure of the polynucleotide repeat yield a biologically active reporter gene 
product. Chemicals that disrupt the polynucleotide repeat are identified as MMR blocking 
agents. 

In another embodiment of the invention, an isolated MMR blocking chemical is 

provided. The chemical can block MMR of a host cell, yielding a cell that exhibits an 

V 

25 enhanced rate of hypermutation. 

In another embodiment of the invention, a method is provided for introducing a 
mutation into a gene of interest A chemical that blocks mismatch repair is added to the 
culture of a cell line. The cells become hypermutable as a result of the introduction of the 
chemical. The cell further comprises a gene of interest. The cell is cultured and tested to 
30 determine whether the gene of interest harbors a mutation. 

In another embodiment of the invention, a method is provided for producing new 
phenotypes of a cell. A chemical that blocks mismatch repair is added to a cell culture. 



WO 02/054856 



4 



The cell becomes hypermutable as a result of the introduction of the chemical. The cell is 
cultured and tested for the expression of new phenotypes. 

In another embodiment of the invention, a method is provided for restoring genetic 
stability in a cell in which mismatch repair is blocked via a chemical agent The chemical 
5 is removed from the cell culture and the cell restores its genetic stability. 

In another embodiment of the invention, a method is provided for restoring genetic 
stability in a cell with blocked mismatch repair and a newly selected phenotype. The 
chemical agent is removed from the cell culture and the cell restores its genetic stability 
and the new phenotype is stable. 
10 In another embodiment of the invention, a chemical method for blocking MMR in 

plants is provided. The plant is grown in the presence of a chemical agent. The plant is 
grown and exhibits an enhanced rate of hypennutation. 

In another embodiment of the invention, a method for screening chemical 
inhibitors of MMR in plants in vivo is provided. MMR-sensitive plant expression vectors 
15 are engineered. The reporter vectors are introduced into plant hosts. The plant is grown in 
the presence of a chemical agent. The plant is monitored for altered reporter gene 
function. 

In another embodiment of the invention, a method is provided for introducing a 
mutation into a gene of interest in aplant. A chemical that blocks mismatch repair is 
20 added to aplant. The plant becomes hypermutable as a result of the introduction of the 
chemical. The plant further comprises a gene of interest. The plant is grown. The plant is 
tested to determine whether the gene of interest harbors a mutation. 

In another embodiment of the invention, a method is provided for producing new 
phenotypes of a plant. A chemical mat blocks mismatch repair is added to a plant. The 
25 plant becomes hypermutable as a result of the introduction of the chemical. The plant is 
grown and tested for the expression of new phenotypes. 

In another embodiment of the invention, a method is provided for restoring genetic 
stability in a plant in which mismatch repair is blocked via a chemical agent The 
chemical is removed from the plant culture and the plant restores its genetic stability. 
30 In another embodiment of the invention, a method is provided for restoring genetic 

stability in a plant with blocked mismatch repair and a newly selected phenotype. The 
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chemical agent is removed from the plant culture and the plant restores its genetic stability 
and the new phenotype is stable. 

These and other embodiments ofthe invention provide the art with methods that 
can generate enhanced mutability in microbes, organisms of the protista class, insect cells, 
5 mammalian cells, plants, and animals as well as providing cells, plants and animals 
harboring potentially useful mutations. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows diagrams of mismatch repair (MMR) sensitive reporter genes. 
10 Figure 2 shows a screening method for identifying MMR blocking chemicals. 

Figure 3 shows identification of a small chemical that blocks MMR and genetically alters 
the pCAR-OF vector in vivo. 

Figure 4 shows shifting of endogenous microsatellites in human cells induced by a 

chemical inhibitor of MMR 
15 FigHre 5 shows sequence analysis of microsateUites from cells treated with chemical 

inhibitors of MMR with altered repeats. 

Figure 6 shows generation of host organisms with new phenotypes using a chemical 
blocker of MMR 

Figure 7 shows a schematic diagram of MMR-sensitive reporter gene for plants. 
20 Figure 8 shows derivatives of lead compounds and thereof mat are inhibitors of MMR in 
vivo. 

DETAILED DESCRIPTION OF THE INVENTION 

Various definitions are provided herein. Most words and terms have the meaning 
25 that would be attributed to those words by one skilled in the art. Wordsorterms 
specifically defined herein have the meaning provided in the context ofthe present 
invention as a whole and as are typically understood by those skilled in the art. Any 
conflict between an art-understood definition of a word or term and a definition ofthe 
word or term as specifically taught herein shall be resolved in favor of the latter. Headings 
30 used herein are for convenience and are not to be construed as limiting. 

As used herein the term "anthracene" refers to the compound anthracene. However, 
when referred to in the general sense, such as "anthracenes," "an anthracene" or "the 
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anthracene," such terms denote any compound that contains the fused tiiphenyl core structure 
of anthracene, ie., 




regardless of extent of substitution. 




V 



3 

R 5 Rjo R 4 



5 In certain preferred embodiments of the invention, the anthracene has the formula: 

wherein R r R 10 are independently hydrogen, hydroxyl, amino, alkyl, substituted alkyl, alkenyl, 
substituted alkenyl alkynyl, substituted alkynyl, O-alkyl, S-alkyl, N-alkyl, O-alkenyl, S- 
alkenyl, N-alkenyiO-alkynyl, S-alkynylN-alkynyl, aryl, substituted aryl, aryloxy, substituted 
aryloxy, heteroaryl, substituted heteroaryl, aralkyloxy, arylalkyl, alkylaryl, alkylaryloxy, 
10 arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidino, carboxy, an alcohol, 
an amino acid, sulfonate, alkyl sulfonate, CN, N0 2 , an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfur compound, an organometallic group, a carboxylic acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxyl 
groups; 

15 wherein said heteroalkyl, heteroaryl, and substituted heteroaryl contain at least one 

heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 

r 

alkynyl, 

" substituted aryl, and substituted heteroaryl are halogen, CN, N0 2 , lower alkyl, aryl, heteroaryl, 
20 aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 
or lower alkyl groups; 

or wherein any two of R r R 10 can together form a polyether; 
or wherein any two of R r R M can, together with the intervening carbon atoms of the 
anthracene core, form a crown ether. 
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As used herein, "alkyl" refers to a hydrocarbon containing from 1 to about 20 carbon 
atoms. Alkyl groups may straight, branched, cyclic, or combinations thereof Alkyl groups 
thus include, by way of illustration only, methyl, ethyl, propyl, isopropyl, butyl, isobutyl, 
cyclopentyl, cyclopentylmethyl, cyclohexyl, cyclohexylmethyl, and the like. Also included 

5 within the definition of "alkyl" are fused and/or polycyclic aliphatic cyclic ring systems such 
as, for example, adamantane. As used herein the term "alkenyl" denotes an alkyl group 
having at least one carbon-carbon double bond. As used herein the term "alkynyl" denotes 
an alkyl group having at least one carbon-carbon triple bond. 

In some preferred embodiments, the alkyl, alkenyl, alkynyl, aryl, aryloxy, and 

10 heteroaryl substituent groups described above may bear one or more further substituent 
groups; that is, they may be "substituted". In some preferred embodiments these substituent 
groups can include halogens (for example fluorine, chlorine, bromine and iodine), CN, N0 2 , 
lower alkyl groups, aryl groups, heteroaryl groups, aralkyl groups, aralkyloxy groups, 
guanidino, alkoxycarbonyl, alkoxy, hydroxy, caiboxy and amino groups. In addition, the 

15 alkyl and aryl portions of aralkyloxy, arylalkyl, arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, 
and aryloxycarbonyl groups also can bear such substituent groups. Thus, by way of example 
only, substituted alkyl groups include, for example, alkyl groups fluoro-, chloro-, bromo- and 
iodoalkyl groups, aminoalkyl groups, and hydroxyalkyl groups, such as hydroxymethyl, 
hydroxyethyl, hydroxypropyl, hydroxybutyl, and the like. In some preferred embodiments 

20 such hydroxyalkyl groups contain from 1 to about 20 carbons. 

As used herein the term "aryl" means a group having 5 to about 20 carbon atoms and 
which contains at least one aromatic ring, such as phenyl, biphenyl and naphthyl. Preferred 
aryl groups include unsubstituted or substituted phenyl and naphthyl groups. The term 
"aryloxy" denotes an aryl group that is bound through an oxygen atom, for example a phenoxy 

25 group. 

In general, the prefix "hetero" denotes the presence of at least one hetero (i.e., non- 
carbon) atom, which is in some preferred embodiments independently one to three O, N, S, 
P, Si or metal atoms. Thus, the term heteroaryl" denotes an aryl group in which one or more 
ring carbon atom is replaced by such a heteroatom. Preferred heteroaryl groups include 
pyridyl, pyrimidyl, pyrrolyl, furyl, thienyl, and imidazolyl groups. 

The term "aralkyl" (or "arylalkyl") is intended to denote a group having from 6 to 15 
carbons, consisting of an alkyl group that bears an aryl group. Examples of aralkyl groups 
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In general, the prefix "hetero" denotes the presence of at least one hetero (i.e., non- 
carbon) atom, which is in some preferred embodiments independently one to three O, N, S, 
P, Si or metal atoms. Thus, the term "heteroaryl" denotes an aryl group in which one or more 
ring carbon atom is replaced by such a heteroatom. Preferred heteroaryl groups include 
5 pyridyl, pyrimidyl, pyrroiyl, furyl, thienyl, and imidazolyl groups. 

The term "aralkyl" (or "arylalkyO is intended to denote a group having from 6 to 15 
carbons, consisting of an aDcyl group that bears an aryl group. Examples of aralkyl groups 
include benzyl, phenethyl, benzhydryl and naphthylmethyl groups. 

The term "alkylaryl" (or "alkaryl") is intended to denote a group having from 6 to 15 
10 carbons, consisting of an aryl group that bears an alkyl group. Examples of aralkyl groups 
include methylphenyl, ethylphenyl and methylnaphthyl groups. 

The term "arylsulfonyl" denotes an aryl group attached through a sulfonyl group, for 
example phenylsulfonyl. The term "alkylsulfonyl" denotes an alkyl group attached through 
a sulfonyl group, for example methylsulfonyl. 
15 . The term "alkoxycarbonyl" denotes a group of formula -C(=0)-0-R where R is alkyl, 

alkenyl, or alkynyl, where the alkyl, alkenyl, or alkynyl portions thereof can be optionally 
substituted as described herein. 

The term "aryloxycarbonyl" denotes a group of formula -C(=0)-0-R where R is aryl, 
where the aryl portion thereof can be optionally substituted as described herein. 
20 The terms "arylalkyloxy" or "aralkyloxy" are equivalent, and denote a group of 

formula -O-R'-R", where R' is R is alkyl, alkenyl, or alkynyl which can be optionally 
substituted as described herein, and wherein R" denotes a aryl or substituted aryl group. 

The terms "alkylaryloxy" or "alkaryloxy" are equivalent, and denote a group of 
formula -O-R'-R", where R' is an aryl or substituted aryl group, and R" is alkyl, alkenyl, or 
25 alkynyl which can be optionally substituted as described herein. 

As used herein, the term "aldehyde group" denotes a group that bears a moiety of 
formula -C(=0)-H. The term "ketone" denotes a moiety containing a group of formula -R- 
C(=0)-R=, where R and R= are independently alkyl, alkenyl, alkynyl, aryl, heteroaryl, aralkyl, 
or alkaryl, each of which may be substituted as described herein. 
30 As used herein, the term "ester" denotes a moiety having a group of formula -R- 

C(=0>0-R= or -R-0-C(=0)-R= where R and R= axe independently alkyl, alkenyl, alkynyl, 
aryl, heteroaryl, aralkyl, or alkaryl, each of which may be substituted as described herein. 
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The term "ether" denotes a moiety having a group of formula -R-OR= or where R and 
R= are independently alkyl, alkenyl, alkynyl, aryl, heteroaryl, aralkyL or alkaryl, each of 
which may be substituted as described herein. 

The term "crown ether" has its usual meaning of a cyclic ether arataining several 
5 oxygen atoms. As used herein the term "organosulfur compound" denotes aliphatic or 
aromatic sulfur containing compounds, for example thiols and disulfides. The term 
"organometaUic group" denotes an organic molecule containing at least one metal atom. 

The term "organosilicon compound" denotes aliphatic or aromatic silicon containing 
compounds, for example alkyl and aryl silanes. 
10 The term "carboxylic acid" denotes a moiety having a carboxyl group, other than an 

amino acid. 

As used herein, the term "amino acid" denotes a molecule containing both an amino 
group and a carboxyl group. In some preferred embodiments, the amino acids are a-, p-, y- 
or 8-amino acids, including their stereoisomers and racemates. As used herein the term "L- 
15 amino acid" denotes an a-amino acid having the L configuration around the a-carbon, that is, 
a carboxylic acid of general formula CHCCOOHXNHjMside chain), having the L- 
configuration. The term "D-amino acid" similarly denotes a carboxylic acid of general 
formula (^(COOHXNHaKside chain), having the D-configuration around the a-carbon. Side 
chains of L-amino acids include naturally occurring and non-naturally occurring moieties. 
20 Non-naturally occurring (i.e., unnatural) amino acid side chains are moieties that are used in 
place of naturally occurring amino acid side chains in, for example, amino acid analogs. See, 
for example, I^hninger, Biochemistry, Second Edition, Worth Publishers, Inc, 1975, pages 
72-77, incorporated herein by reference. Amino acid substituents may be attached through 
their carbonyl groups through the oxygen or carbonyl carbon thereof, or through their amino 
25 groups, or through functionalities residing on their side chain portions. 

As used herein "polynucleotide" refers to a nucleic acid molecule and includes 
genomic DNA cDNA, RNA, mRNA and the like. 

As used herein "antisense oligonucleotide" refers to a nucleic acid molecule that is 
complementary to at least a portion of a target nucleotide sequence of interest and specifically 
30 hybridizes to the target nucleotide sequence under physiological conditions. 

As used herein ''inhibitor of mismatch repair" refers to an agent that interferes with at 
least one function of the mismatch repair system of a cell and thereby renders the cell more 
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susceptible to mutation. 

As used herein 'Tiypeimutable" refers to a state in which a cell in vitro or in vivo is 
made more susceptible to mutation through a loss or impairment of the mismatch repair 
system. 

5 As used herein "agents," "chemicals," and "inhibitors" when used in connection 

with inhibition of MMR refers to chemicals, oligonucleotides, analogs of natural 
substrates, and the like that interfere with normal function of MMR. 

Methods for developing hypermutable cells and whole organisms have been 
discovered by taking advantage of the conserved mismatch repair (MMR) process of a 
10 host. Dominant negative alleles of MMR genes, when introduced into cells or transgenic 
animals, increase the rate of spontaneous mutations by reducing the effectiveness of DNA 
repair and thereby render the cells or animals hypermutable. Hypermutable microbes, 
protozoans, insects, mammalian cells, plants or whole animals can then be utilized to 
develop new mutations in a gene of interest. It has been discovered that chemicals that 
15 block MMR, and thereby render cells hypermutable, is an efficient way to introduce 
mutations in cells and genes of interest. In addition to destabilizing the genome of cells 
exposed to chemicals that inhibit MMR activity may be done transiently, allowing cells to 
become hypermutable, and removing the chemical exposure after the desired effect (e.g., a 
mutation in a gene ofinterest) is achieved. The chemicals that inhibit MMR activity that 
20 are suitable for use in the invention include, but are not limited to, anthracene derivatives, 
nonhydrolyzable ATP analogs, ATPase inhibitors, antisense oligonucleotides that 
specifically anneal to polynucleotides encoding mismatch repair proteins, DNA 
polymerase inhibitors, and exonuclease inhibitors. These chemicals can enhance the rate 
of mutation due to inactivation of MMR yielding clones or subtypes with altered 
25 biochemicalproperties. Methods for identifying chemical compounds that inhibit MMR 

in vivo are also described herein. 

The process of MMR, also called mismatch proofreading, is carried out by a group 
of protein complexes in cells ranging from bacteria to man (Haffe B.D., and S. Jinks- 
Robertson (2000) An. Rev. Genet. 34:359-399; Modrich, P. (1994) Science 
30 266:1959-1960). An MMR gene is a gene that encodes for one ofthe proteins of such a 
mismatch repair complex. Although not wanting to be bound by any particular theory of 
mechanism of action, an MMR complex is believed to detect distortions ofthe DNA helix 
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resulting from non-complementary pairing of nucleotide bases. The non-complementary 
base on the newer DNA strand is excised, and the excised base is replaced with the 
appropriate base, which is complementary to the older DNA strand. In this way, cells 
eliminate many mutations that occur as a result of mistakes in DNA replication. 

5 Dornmant negative alleles cause an MMR defective phenotype even in the presence 

of a wild-type allele in the same cell. An example of a dominant negative allele of an 
MMR gene is the human gene hPMS2-134 (SEQ ID NO:25), which carries a truncating 
mutation at codon 134 (Nicol^ 18:1635-1641). The 

mutation causes the product of this gene to abnormally terminate at the position of the 

.0 • 134th amino acid, resulting in a shortened polypeptide contaiiung me N-ternunal 133 

amino acids (SEQ ID NO:24). Such a mutation causes an increase in the rate of mutations, 
which accumulate in cells after DNA replication. Expression of a dominant negative allele 
of a mismatch repair gene results in impairment of mismatch repair activity, even in the 

presence of the wild-type allele. 
15 - TheMMRprocesshasbeenshowntobeblockedbymeuseofnonhydrolyzable 

forms of ATP (Galio, L. et al (1999) Nucl. Acids Res. 27:2325-2331; Allen, DJ. et aL 
(1997) EMBO J. 16:4467-4476; Bjomson, K.P. et aL (2000) Biochem. 39:3176-3183). 
However, it has not been demonstrated that chemicals can block MMR activity in cells. 
Such chemicals can be identified by screening cells for defective MMR activity. Cells 
20 from bacteria, yeast, fungi, insects, plants, animals, and humans can be screened for 
defective mismatch repair. GenormcDNAcDNAorrnRNAfromanycellcanbe 
analyzed for variations from the wild type sequences in cells or organisms grown in the 
presence of MMR blocking compounds. Various techniques of screening can be used. 
The suitability of such screening assays, whether natural or artificial, for use in identifying 
25 hypermutable cells, insects, fungi, plants or animals can be evaluated by testing the 

mismatch repair activity caused by a compound or a mixture of compounds, to determine 

if it is an MMR inhibitor. 

A cell, a microbe, or a whole organism such as an insect, fungus, plant or animal in 
which a chemical inhibitor of mismatch repair has been treated will become hypermutable. 
30 This means that the spontaneous mutation rate of such cells or whole organism is elevated 
compared to cells or animals without such treatment The degree of elevation of the 
spontaneous mutationrate canbe at least 2-fold, 5-fold, 10-fold, 20-fold, 50-fold, 100- 
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fold, 200-fold, 500-fold, or 1000-fold that of the normal cell or animal. The use of 
chemical mutagens such as, but limited to, N-methyl-N 1 -nitro-N-nitrosoguanidine 
(MNNG), methane sulfonate, dimethyl sulfonate, 06-methyl benzadine, ethyl 
methanesulfonate (EMS), methylnitrosourea (MNU), ethylnitrosourea (ENU), etc. can be 

5 used in MMR defective cells or whole organisms to increase the rates an additional 10 to 
100 fold that of the MMR deficiency itself. 

According to one aspect of the invention, a screening assay for identifying 
chemical inhibitors of MMR is developed and employed. A chemical compound can be in 
any form or class ranging from but not limited to amino acid, steroidal, aromatic, or lipid 

10 precursors. The chemical compound can be naturally occurring or made in the laboratory. 
The screening assay can be natural such as looking for altered endogenous repeats within 
an host organism's genome (as demonstrated in Figs. 4 and 5), or made in the laboratory 
using an MMR-sensitive reporter gene as demonstrated in Figs. 1-3). 

The chemical compound can be introduced into the cell by supplementing the 

15 growth medium, or by intracellular delivery such as but not limited to using microinjection 
or carrier compounds. 

According to another aspect of the invention, a chemical compound from the 
anthracene class can be exposed to MMR proficient cells or whole organism hosts, the host 
is grown and screened for subtypes containing genetically altered genes with new 

20 biochemical features. 

The anthracene compounds that are suitable for use in the invention include, but 
are not limited to anthracenes having the formula: 




wherein R r R 10 are independently hydrogen, hydroxyl, amino, alkyl, substituted aftyl, alkenyl, 
25 substituted alkenyl, alkynyl, substituted alkynyl, O-alkyl, S-alkyl, N-alkyl, O-alkenyl, S- 
alkenyl, N-alkenyl,Oalkynyl, S-alkynyl, N-alkynyl, aryl, substituted aryl, atyloxy, substituted 
aryloxy, heteroaryl, substituted heteroaryl, aralkyloxy, arylalkyl, alkylaryl, alkylaryloxy, 
arylsulfonyl, alkylsulfonyl, alkoxycaibonyl, aryloxycaibonyl, guanidino, carboxy, an alcohol, 



WO 02/054856 



13 



PCT/USOl/00934 



' ^ ^0 acid, sulfonate, alkyl sulfonate, CN, NO, an aldehyde group, an ester, an ether, a 
crown ether, a ketone, anorganosulfur compound, an organometallic group, acarboxylic acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxyl 
groups; 

5 wherein said heteroalkyl, heteroaryl and substituted heteroaryl contain at least one 

heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 

alkynyl, 

^sntutedaryl.andsute 
10 aralkyl, aralkyloxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and ammo; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 

or lower alkyl groups; 

or wherein any two of R r R,o can together form a polyether; 
or wherein any two of Rl -R w can, together with the intervening carbon atoms of the 

15 anthracene core, form a crown ether. 

Themethod of a. inventionalso encompasses inhibiting MMR with an anthracene 
of the above formula wherein Rj and R« are hydrogen, tmd the remaimng subsotuent. - as 
described above. 

The some embodiments, in the anthracene compound R r R w are independently 

20 hydmgen.hydroxyl.alkvU^^ 

independently hydrogen, hydroxyl, memyl, emyl, propyl, isop^ 

tolyl, hydroxymethyl, hydroxypropyl, or hydroxybutyl. 

hi specific embodiments of the invention the anthracenes include, but are not 
hmitedto U-dimethylanthracene, 9,10-dimethyl anthracene, 7,8-dimethylanthracene, 
25 9 10-diphenylanthracene, 9,10-dlhydroxymethylanthracene, 9-hydroxymethyl-10- 
ttemylanthrac^ 

1,2-diol, 9.hydroxymemyl-10-methylantocene-3 9, 10-di-m-tolyanthracene, and 

the like. . , t 1- *~A 

The chiral position of the side chains of the anthracenes is not particularly limited 
30 andmaybeanycbiralposiuon^ 

a stereoisomers forms of the anthracenes and includes any isomeric analog. 
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Examples of hosts are but not limited to cells or whole organisms from human, 
primate, mammal, rodent, plant, fish, reptiles, amphibians, insects, fungi, yeast or 
microbes of prokaryotic origin. 

Yet another aspect of the invention is the use of ATP analogs capable of blocking 
5 ATPase activity required for MMR. MMR reporter cells are screened with ATP 
compound libraries to identify those compounds capable of blocking MMR in vivo. 
Examples of ATP analogs that are useful in blocking MMR activity include, but are not 
limited to, nonhydrolyzable forms of ATP such as AMP-PNP and ATP[gamma]S block 
the MMR activity (Galio, L. et al (1999) Nucl. Acids Res. 27:2325-2331; Allen, DJ. et al. 
10 (1997) EMBO J. 16:4467-4476; Bjornson K.P. et al (2000) Biochem. 39:3176-3183). 

Yet another aspect of the invention is the use of nuclease inhibitors that are able to 
block the exonuclease activity of the MMR biochemical pathway. MMR reporter cells are 
screened with nuclease inhibitor compound libraries to identify compounds capable of 
blocking MMR in vivo. Examples of nuclease inhibitors that are useful in blocking MMR 
15 activity include, but are not limited to analogs of N-Ethylmaleimide, an endonuclease 
inhibitor (Huang, Y.C., et.al. (1995) Arch. Biochem. Biophys. 316:485), heterodimeric 
adenme-cham-acridine compounds, exonulcease HI inhibitors (Belmont P, etal., Bioorg 
Med Chem Lett (2000) 10:293-295), as well as antibiotic compounds such as 
Hehquinomycin, which have helicase inhibitory activity (Chino, M, etal. /. Antibiot. 
20 (Tokyo) (1998) 51 :480-486). 

Another aspect of the invention is the use of DNA polymerase inhibitors that are 
able to block the polymerization required for mismatch-mediated repair. MMR reporter 
cells are screened with DNA polymerase inhibitor compound libraries to identify those 
compounds capable of blocking MMR in vivo. Examples of DNA polymerase inhibitors 
25 that are useful in blocking MMR activitylnclude, but are not limited to, analogs of 
actinomycin D (Martin, S.J., etal. (1990) J. Immunol. 145:1859), Aphidicolin 
(Kuwakado, K. etal. (1993) Biochem. Pharmacol 46:1909) l^'-Deoxy^'-fluoro-beta-L- 
arabmoruranosyl)-5-methyluracU (L-FMAU) (Kukhanova M etal., Biochem Pharmacol 
(1998) 55:1181-1187), and ^^'-dideoxyribonucleoside 5'-triphosphates (ddNTPs) (Ono, 
30 K., etal., Biomed Pharmacother (1984) 38:382-389). 

In yet another aspect of the invention, antisense oligonucleotides are administered 
to cells to disrupt at least one function of the mismatch repair process. The antisense 
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polynucleotides hybridize to MMR polynucleotides. Both full-length and antisense 
polynucleotide frgaments are suitable for use. "Antisense polynucleotide fragments" of 
the invention include, but are not limited to polynuclotides that specifically hybridize to an 
MMR encoding RNA (as determined by sequence comparison of nucleotides encoding the 
5 MMR to nucleotides encoding other known molecules). Identification of sequences that 
are substantially unique to MMR-encoding polynucleotides can be ascertained by analysis 
of any publicly available sequence database and/or with any commercially available 
. sequence comparison programs. Antisense molecules may be generated by any means 
including, but not limited to chemical synthesis, expression in an in vitro transcription 
10 reaction, through expression in a transformed cell comprising a vector that may be 
transcribed to produce antisense molecules, through restriction digestion and isolation, 
through the polymerase chain reaction, and the like. 

Antisense oUgonucleotides, or fragments thereof may include the nucleotide 
sequences set forth in SEQ ID NOs:15, 17, 19, 21, 23, 25, 27, and 29 or sequences 
15 complementary or homologous thereto, for example. Those of skill in the art recognize 
that the invention may be predicted using any MMR gene. Specifically, antisense nucleic 
acid molecules comprise a sequence complementary to at least about 10, 15, 25, 50, 100, 
250 or 500 nucleotides or an entire MMR encoding sequence. Preferably, the antisense 
oUgonucleotides comprise a sequence complementary to about 15 consecutive nucleotides 
20 of the coding strand of the MMR encoding sequence. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of anucleotide sequence encoding an MMR protein. The 
coding strand may also include regulatory regions of the MMR sequence. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons which 
25 are translated into amino acid residues (e.g., the protein coding region of human PMS2 
corresponds to the coding region SEQ ID NO:17). In another embodiment, the antisense 
nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a 
nucleotide sequence encoding an MMR protein. The term "noncoding region" refers to 5' 
and 3' sequences which flank the coding region that are not translated into amino acids 
30 also referred to as 5' and 3' untranslated regions (UTR)). 

Preferably, antisense oUgonucleotides are directed to regulatory regions of a 
nucleotide sequence encoding an MMR protein, or mRNA corresponding thereto, 
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including, but not limited to, the initiation codon, TATA box, enhancer sequences, and the 
like. Given the coding strand sequences provided herein, antisense nucleic acids of the 
invention can be designed according to the rules of Watson and Crick or Hoogsteen base 
pairing. The antisense nucleic acid molecule can be complementary to the entire coding 
5 region of anMMR mRNA but more preferably is an oligonucleotide that is antisense to 
only a portion of the coding or noncoding region of an MMR mRNA. For example, the 
antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of an MMR mRNA. An antisense oligonucleotide can be, for example, about 5, 
10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. 
10 Screening is any process whereby a chemical compound is exposed to a cell or 

whole organism. The process of screening can be carried out using but not limited to a 
whole animal, plant, insect, microbe, or by using a suspension of one or more isolated cells 
in culture. The cell can be any type of eukaryotic or prokaryotic cell, including, for 
example, cells isolated from humans or other primates, mammals or other vertebrates, 
15 invertebrates, and single celled organisms such as protozoa, yeast, or bacteria. 

In general, screening will be carried out using a suspension of cells, or a single cell, 
but other methods can also be applied as long as a sufficient fraction of the treated cells or 
tissue is exposed so that isolated cells can be grown and utilized. Techniques for chemical 
screening are well known to those in the art. Available techniques for screening include 
20 cell-based assays, molecular assays, and whole organism-based assays. Compounds can 
be added to the screening assays of the invention in order to identify those agents that are 
capable of blocking MMR in cells. 

The screening assays of the invention provide a system wherein a cell, cells or a 
whole organism is contacted with a candidate compound and then tested to determine 
25 whether mismatch repair has been adversely affected. The method in which MMR is 
analyzed may be any known method, including, but not limited to analysis of the 
molecular sequence of the MMR gene, and analyzing endogenous repeats in the subject's 
genome. Further, the invention provides a convenient assay to analyze the effects of 
candidate agents on reporter genes transfected into cells. 
30 MMR-inhibitors identified by the methods of the invention can be used to generate 

new mutations in one or more gene(s) of interest. A gene of interest can be any gene 
naturally possessed by a cell line, microbe or whole organism. An advantage of using 
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chemicals rather than recombinant technologies to block MMR are that the process is 
fester, there is no need to produce stable clones with a knocked out MMR gene or a clone 
expressing a dominant negative MMR gene allele. Another advantage is that host 
organisms need not be screened for integrated knock out targeting vectors or stable 
5 expressionofadominantnegativeMMRgeneallele. Finally, once a cell, plant or animal 
has been exposed to the MMR-blocking compound and a new output trait is generated, the 
MMR process can be restored by removal of compound. Mutations can be detected by 
analyzing the genotype of the ceU, or whole organism, for example, by examining the 
sequence of genomic DNA, cDNA, messenger RNA, or amino acids associated with the 
10 gene of interest Mutations can also be detected by screening for new output traits such as 
hypoxanthme-guanme phosphorite Amutant 
polypeptide can be detected by identifying alterations in electrophoretic mobility, 
spectroscopic properties, or other physical or structural characteristics of a protein encoded 
by a mutant gene. One can also screen for altered function of the protein in situ, in isolated 
form, or in model systems. One can screen for alteration of any property of the cell, plant 
or animal associated with the function of the gene of interest. 

Several advantages exist in generating genetic mutations by blocking MMR m vivo 

inco^t^B^^^^^^^ 0 ^^^ CeUSWith 
MMR deficiency have a wide range of mutations dispersed throughout their entire genome 
20 in contrast to DNA damaging agents such as MNNG, MNU, EMS and ionizing radiation. 
Another advantage is that mutant cells that arise from MMR deficiency are diploid in 
nature and do not lose large segments of chromosomes as is the case of DNA damaging 
agents such as EMS, MNU, and ionizing radiation (Homna, M. et al. (1997) Mutat. Res. 
374:89-98). This unique feature allows for subtle changes throughout a host's genome that 
leads to subtle genetic changes yielding genetically stable hosts with commercially 

important output traits. 

The invention also encompasses blocking MMR in vivo and in vitro and further 
exposing the cells or organisms to a chemical mutagen in order to increase the incidence of 
genetic mutation. 

30 The invention also encompasses withdrawing exposure to inhibitors of mismatch 

repair once a desired mutant genotype or phenotype is generated such mat the mutations 
are thereafter maintained in a stable genome. 
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The above disclosure generally describes the present invention. A more complete 
understanding can be obtained by reference to the following specific examples, which are 
provided herein for purposes of illustration only, and are not intended to limit the scope of 
the invention. 

5 EXAMPLES 

EXAMPLE 1: Generation of a cell-based screening assay to identify chemicals 
capable of inactivating mismatch repair in vivo. 

A hallmark ofMMR deficiency is the generation of unstable microsatellite repeats 
in the genome of host cells (Peinado, M.A. et al. (1992) Proc. Natl. Acad. Sci. USA 
10 89:10065-10069; Strand, M. et al. (1993) Nature 365:274-276; Parsons, R. et al. (1993) 
Cell 75:1227-1236). This phenotype is referred to as microsatellite instability (MI) (Harfe, 
B.D. and S. Jinks-Robertson (2000) Ann. Rev. Genet. 34:359-399; Modrich, P. (1994) 
Science 266:1959-1960; Peinado, M.A. et al. (1992) Proc. Natl. Acad. Sci. USA 89:10065- 
10069; Perucho, M. (1996) Biol. Chem. 377:675-684; Hoang, J.M. et al. (1997) Cancer 
15 Res. 57:300-303; Strand, M. et a/.(1993) Nature 365:274-276). MI consists of deletions 
and/or insertions within repetitive mono-, di- and/or tri nucleotide repetitive sequences 
throughout the entire genome of a host cell. Extensive genetic analysis of eukaryotic cells 
have found that the only biochemical defect that is capable of producing MI is defective 
MMR (Harfe, B.D. and S. Jinks-Robertson (2000) Ann. Rev. Genet. 34:359-399; Modrich, 
20 P. (1994) Science 266:1959-1960; Peinado, M.A. et al. (1992) Proc. Natl. Acad. Sci USA 
89:10065-10069; Perucho, M. (1996) Biol. Chem. 377:675-684; Hoang, J.M. et al. (1997) 
Cancer Res. 57:300-303; Strand, M. et a/.(1993) Nature 365:274-276). In light of this 
unique feature that defective MMR has on promoting microsatellite instability, 
endogenous MI is now used as a biochemical marker to survey for lack of MMR activity 
25 within host cells (Hoang, J.M. et al. (1997) Cancer Res. 57:300-303). 

A method used to detect MMR deficiency in eukaryotic cells is to employ a 
reporter gene that has a polynucleotide repeat inserted within the coding region that 
disrupts its reading ftame due to a frame shift. In the case where MMR is defective, the 
reporter gene will acquire random mutations (i.e., insertions and/or deletions) within the 
30 polynucelotide repeat yielding clones that contain a reporter with an open reading frame. 
This reporter gene can be of any biochemical pathway such as but not limited to p- 
glucoronidase, p-galactosidase, neomycin resistant gene, hygromycin resistance gene, 
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green fluorescent protein, and the like. A schematic diagram of MMR-sensitive reporters 
are shown in Fig. 1, where the polynucleotide repeat can consist of mono-, di-, tri- or tetra- 
nucleotides. We have employed the use of a p-galactosidase MMR-sensitive reporter gene 
to measure for MMR activity in H36 cells, which are a murine hyhridoma cell line. The 
reporter construct used is called pCAR-OF, which contains a hygromycin resistance 
(HYG) gene plus a p-galactosidase gene with a 29 bp out-of-frame poly-CA tract inserted 
at the 5' end of its coding region. The pCAR-OF reporter cannot generate p-galactosidase 
activity unless a frame-restoring mutation (j.e., insertion or deletion) arises following 
transfection. This line has been shown to be sensitive to inactivated MMR where using a 
dominant negative MMR gene allele has found this condition to result in the production of 
P-galactosidase (unpublished data). An example of these data using the dominant 
negativePMS134 allele is showninTable 1. Briefly, H36 cells were each transfected with 
an expression vector containing the PMS134 allele (referred to as HB134) or empty vector 
and the pCAR-OF vector in duplicate reactions using the protocol below. The PMS134 
gene is cloned into the pEF expression vector, which contains the elongation factor 
promoter upstream of the cloning site followed by a mammahan polyadenylation signal. 
This vector also contains the NEOr gene that allows for selection of cells in G418 to 
identify those retaining this plasmid. Briefly, cells were transfected with 1 ug of the 
PMS 134 or empty vector using polyliposomes following the manufacturer's protocol (Life 
Technologies). Cells were then selected in 0.5 mg/ml of G418 for 10 days and G418 
resistant cells were pooled together to analyze for gene expression. PMS134 positive cells, 
which were determinedby RT-PCR and westemblot (not shown) were expanded and 
transfected with the pCAR-OF reporter gene that contains a hygromycin (HYG) resistance 
gene as reporter using the protocol described above. Cells were selected in 0.5 mg/ml 
G418 and 0.5mg/ml HYG to select for cells retaining both the MMR effector and the 
pCAR-OF reporter plasmids. All cultures transfected with the pCAR vector resulted in a 
similar number ofHYG/G418 resistant cells. Cultures were then expanded and tested for 
p-galactosidase activity in situ as well as by biochemical analysis of cell extracts. For in 
situ analysis, 100,000 cells were harvested and fixed in 1% gluteraldehyde, washed in 
3 phosphate buffered saline solution and incubated in 1 ml of X-gal substrate solution [0.15 
M NaCl, 1 mM MgCl, 3.3 mM W 3.3 mM K 3 Fe(CN) 6 , 0.2% X-Gal ] in 24 w 11 
platesfor 2hours at37°C. Reactions were stopped in 500 mM sodium bicarbonate 
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solution and transferred to microscope slides for analysis. Three fields of 200 cells each 
were counted for blue (p-galactosidase positive cells) or white (P-galactosidase negative 
cells) to assess for MMR inactivation. Table 1 shows the results from these studies. 
While no p-galactosidase positive cells were observed in H36 empty vector cells and 10% 
5 of the cells per field were p-galactosidase positive in HB134 cultures. 

Table 1. p-galactosidase expression of H36 empty vector and HB134 cells transfected 
with pCAR-OF reporter vectors. Cells were transfected with the pCAR-OF reporter 
plasmid. Transfected cells were selected in HYG and G418, expanded and stained with 
10 X-gal solution to measure for p-galactosidase activity (blue colored cells). 3 fields of 200 
cells each were analyzed by microscopy. The results below represent the mean +/- 
standard deviation of these experiments. 
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CELL LINE 


# BLUE CELLS 


H36 empty vector 


0+/-0 


HB134 


20+/- 3 



Cultures can be further analyzed by biochemical assays using cell extracts to measure p- 
galactosidase activity as previously described (Nicolaides, N.C. et at (1998) Mol. Cell. 

Biol. 18:1635-1641). 

The data described in Table 1 show that by inhibiting the MMR activity of an 
MMR proficient cell host can result in MI and the altering of microsatellites in the pCAR- 
OF vector results in cells that produce functional p-galactosidase enzyme. The use of the 
H36pCAR-OF cell line can now be used to screen for chemicals that are able to block 
MMR of the H36 cell line. 



25 EXAMPLE 2: Screening assays for identifying chemical blockers of MMR. 

A method for screening chemical libraries is provided in this example using the 
H36pCAR-OF cell line described in Example 1. This cell line is a hardy, stable line that 
can be formatted into 96-well microtiter plates for automated screening for chemicals that 
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specifically block MMR. An overview of the screening process is given in Figure 2, 
however, the process is not limited to the specifications within this example. Briefly, 
1 0,000 cells in a total volume of 0.1ml of growth medium (RPMI1640 plus 10% fetal 
bovine serum) are added to 96-well microtiter plates containing any variety of chemical 

5 compounds. Cells are grown for 14-17 days at 37°C in 5%C0 2 . Cells are then lysed in the 
growth medium with 50uls of lysis buffer containing 0.1 MTris buffer (pH 8.0), 0.1% 
Triton X-100, 45 mM 2-mercaptoethanol, ImM MgC^, 0.1 M NaP0 4 and 0.6 mg/ml 
Chlorophenol-red- p-D-galactopyranoside (CPRG, Roche). Reactions are incubated for 1 
hour, terminated by the addition of 50 [ils of 0.5 M Na^Oa, and analyzed by 

10 spectrophotometry at 576 nm. 

Experimental wells are compared to untreated or vehicle treated wells to identify 
those with increased (J-galactosidase activity. Compounds producing MMR blocking 
activity are then further analyzed using different cell lines containing the pCAR-OF 
plasmid to measure the ability to block MMR as determined by MI in MMR proficient 

15 hosts by analyzing endogenous microsateUites for instability using assays described below. 

EXAMPLE 3: Defining MMR blocking chemicals. 

The identification of chemical inhibitors of MMR can be difficult in determining 
those that are standard mutagens from those that induce genomic instability via the 

20 blockade of MMR This Example teaches of a method for detennining blockers of MMR 
from more general mutagens. Once a compound has been identified in the assay 
described above, one can determine if the compound is a general mutagen or a speific 
MMR blocker by monitoring mutation rates in MMR proficient cells and a controlled 
subclone that is MMR defective. One feature of MMR deficiency is the increased 

25 resistance to toxicity of DNA alkylating agents that allows for enhanced rates of mutations 
upon mutagen exposure (Liu, L., etal. Cancer Res (1996) 56:5375-5379). This unique 
feature allows for the use of a MMR proficient cell and a controlled line to measure for 
enhanced activity of a chemical compound to induce mutations in MMR proficient vs 
MMR deficient lines. If the compound is a true inhibitor of MMR then genetic mutations 

30 should occur in MMR proficient cells while no "enhanced " mutation rate will be found in 
already MMR defective cells. Using these criteria chemicals such as ICR191, which 
induces frameshift mutations in mammalian cells would not be considered a MMR 
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reporter construct used is called pCAR-OF, which contains a hygromycin resistance 
(HYG) gene plus a p-galactosidase gene with a 29 bp out-of-frame poly-CA tract inserted 
at the 5' end of its coding region. The pCAR-OF reporter cannot generate p^galactosidase 
activity unless a frame-restoring mutation (ie., insertion or deletion) arises followmg 
transfection. This line has been shown to be sensitive to inactivated MMR where usmg a 
dominant negative MMR gene allele has found this condition to result in the production of 
P-galactosidase (unpublished data). An example of these data using the dominant 
negativePMS134 allele is shown in Table 1. Briefly, H36 cells were each transfected with 
an expression vector containmg me PMS134 allele (referred to as HB134) or empty vector 
and the pCAR-OF vector in duplicate reactions using the protocol below. The PMS134 
gene is cloned into the pEF expression vector, which contains the elongation factor 
promoter upstream of the cloning site followed by a mammalian polyadenylation signal. 
This vector also contains theNEOr gene that allows for selection of cells in G418 to 
identifymoseretamingthisplasmid. Briefly, cells were transfected with 1 ug of the 
; pMS134oremptyvectorusmgpolyliposomesfoUowmgmemanufac^ 

Technologies). Cells were then selected in 0.5 mg/ml of G418 for 10 days and G418 
resistant cells were pooled togemer to analyze for gene expression PMS134 positive cells, 
which were determined by RT-PCR and western blot (not shown) were expanded and 
transfected with the pCAR-OF reporter gene that contains a hygromycin (HYG) resistance 
0 geneasreporterusingtheprotocoldescribedabove. Cells were selected in 0.5 mg/ml 
G41 8 and 0.5mg/ml HYG to select for cells retaining both the MMR effector and the 
pCAR^F reporter plasmids. All cultures transfected with the P CAR vector resulted ma 
sijnilar number of HYG/G418 resistant cells. Cultures were men expanded and tested for 
p-galactosidase activity in situ as well as by biochemical analysis of cell extracts. Form 
15 ^analysis, 100,000 cells were harvested and fixed in 1% gluteraldehyde, washed m 
phosphate buffered saline solution and incubated in 1 ml of X-gal substrate solution [0.15 
M NaCU mM MgCl, 3.3 mM W CNfc 3 .3 mM K^CN)* 0.2»/o X-Gal ] in24well 
plates for 2 hours at 3TG. Reactions were stopped in 500 mM sodium bicarbonate 
solution and transferred to microscope slides for analysis. Three fields of 200 cells each 
30 were counted for blue (P-galactosidase positive cells) or white (P-galactosidase negative 
cells) to assess for MMR inactivaticn. Table 1 shows the results from these studies. 
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blocking compound because of its ability to produce enhanced mutation rates in already 
MMR defective cell lines (Chen, WX>., etal. J Natl Cancer Inst. (2000) 92:480-485). 
These screening lines include the but are not limited those in which a doininant negative 
MMR gene has been introduced such as that described in EXAMPLE 1 or those in which 
5 naturally MMR deficient cells such as HCT1 16 has been cured by introduction of a 
complementing MMR gene as described (Chen, W.D., etal. J Natl Cancer Inst. (2000) 
92:480-485). 

EXAMPLE 4: Identification of chemical inhibitors of MMR in vivo. 
10 MMR is a conserved post replicative DNA repair mechanism mat repairs point 

mutations and insertion/deletions in repetitive sequences after cell division. The MMR 
requires an ATPase activity for initiation complex recognition and DNA translocation. In 
vitro assays have shown that the use of nonhydrolyzable forms of ATP such as AMP-PNP 
and ATP[gamma]S block the MMR activity (Galio, L. et al. (1999) Nucl. Acids Res. 
15 27:2325-2331; Allen, D.J. et al. (1997) EMBOJ. 16:4467-4476; BjomsonK.P. et al. 
(2000) Biochem. 39:3176-3183). 

The use of chemicals to inhibit endogenous MMR in vivo has not been 
distinguished in the public domain. In an attempt to identify chemicals that can inhibit 
MMR in vivo, we used our H36pCAR-OF screening assay to screen for chemicals that are 
20 able to cause microsatellite instability and restoration of p-galactosidase activity from the 
pCAR-OF vector, an effect that can only be caused due to MMR deficiency. In our 
screening assays we used a variety of classes of compounds ranging from steroids such as 
pontasterone to potent alkylating agents such as EMS, to kinase and other enzyme 
inhibitors. Screens identified one class of chemicals that were capable of generating p- 
25 galactosidase positive cells. These molecules were derived from the anthracene class. An 
example of one such anthracene derivative for the purposes of this application is a 
molecule called 9,10-dimethylanthracene, referred to from here on as DMA. Fig. 3 shows 
the effect of DMA in shifting the pCAR-OF reporter plasmid. In contrast, general DNA 
alkylating agents such as EMS or MNNG did not result in MI and/or the shifting of the 
30 polynulceotide tract in the pCAR-OF reporter. 

The most likely explanation for the differences in Matecfc>sidase activity was that 
the DMA compound disturbed MMR activity, resulting in a higher frequency of mutation 
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vnthinthepCAR-OFrepoiterandre-estabUshmgtheORF. To directly test the hypothesis 
that MMR was altered, we employ abiochemical assay for MMR with the individual 
clones as described by Nicolaides et al., 1997 (Nicolaides, N.C. et al (1998) Mol. Cell. 
Biol. 18:1635-1641). Nuclear extracts are prepared from me clones and incubated with 
5 heteroduplex substrates containing either a /CA\ insertion-deletion or a G/T mismatch 
under conditions described previously. The /CA\ and G/T heteroduplexes are used to test 
repair from the 3' and 5' directions, respectively as described (Nicolaides, N.C. et al. 
(1998) Mol. Cell. Biol. 18:1635-1641). 

10 Biochemical assays for mismatch repair. 

Enzymatic Repair Assays: 

MMR activity in nuclear extracts is performed as described, using 24 frnol of 

substrate (Nicolaides, N.C. et al. (1998) Mol Cell Biol 18:1635-1641). 

Complementation assays are done by adding ~ 100 ng of purified MutLa or MutSa 
15 components to 100 ug of nuclear extract, adjusting the final KC1 concentration to 100 mM 

(Nicolaides, N.C. et al. (1998) Mol Cell. Biol 18:1635-1641). The substrates used in 

these experiments contain a strand break 181 nucleotides 5' or 125 nucleotides 3' to me 

mismatch. 

20 Biochemical Activity Assays : 

To demonstrate the direct effect to small molecules on MMR proteins, molecular 
assays such as mismatch binding and MMR complex formation are performed in the 
presence or absence of drug. Briefly, MMR gene cDNAs are PCR amplified using primers 
encompassing the entire coding regions of the known MMR proteins MSH2 (SEQ ID 
25 NO:20), GTBP (SEQ ID NO:26), MLH1 (SEQ ID NO:22), human PMS2 (SEQ ID 
NO:16), mouse PMS2 (SEQ ID NO:14), PMS1 (SEQ ID NO:18), and MHS3 (SEQ ID 
NO:28) from any species with a sense primer containing a T7 promoter and a Kozak 
translation signal as previously described (Nicolaides, N.C. et al. (1998) Mol Cell. Biol. 
18:1635-1641). The coding regions of known MMR proteins include the sequences shown 
30 inTable 3 for mouse PMS2 (SEQ TD NO:15), human PMS2 (SEQ ID NO:17), human 

PMS1 (SEQroNO:19),humanK5H2(SEQIDNO:21),humanil4IH/ (SEQ ID NO:23), 
and human MSH3 (SEQ ID NO:29). Products are transcribed and translated using the 
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TNT system (Promega). An example of PCR primers and in vitro transcription- 
translation reactions are listed below. 



In vitro transcription-translation: 

5 Linear DNA fragments obtaining hPMS2 (SEQ ID NO: 17) and hMLHl (SEQID 

NO:23) cDNA sequences were prepared by PCR, incorporating sequences for in vitro 
transcription and translation in the sense primer. A mil-length hMLHl fragment was 
prepared using the sense primer 

5^ggatcctaatacgactcactatagggagawaccatgtcgttcgtggcaggg-3' (SEQ ID NO:l)(codons 1-6) 
10 and the antisense primer 5'-taagtcttaagtgctaccaac-3' (SEQ ID NO:2)(located in the 3' 

untranslated region, nt 241 1-2433), using a wild-type hMLHl cDNA clone as template. A 
full-length hPMS2 fragment was prepared with the sense primer 
5^ggatcctaatacgactcactatagggagaccaccatggaacaattgcctgcgg-3' (SEQ ID NO:3)(codons 1-6) 
and the antisense primer 5'-aggttagtgaagactctgtc-3' (SEQ ID NO:4)(located in 3' 
15 untranslated region, nt 2670-2690) using a cloned hPMS2 cDNA as template. These 

fragments were used to produce proteins via the coupled transcription-translation system 
(Promega). The reactions were supplemented with 35 S-labelled methionine or unlabelled 
methionine. Lower molecular weight bands are presumed to be degradation products 
and/or polypeptides translated from alternative internal methionines. 
20 To study the effects ofMMR inhibitors, assays are used to measure the formation 

of MLH1 and PMS2 with or without compound using polypeptides produced in the TNT 
System (Promega) followed by immunoprecipitation (IP). To facilitate the IP, tags may be 
placed at the C-terminus of the PMS2 protein to use for antibody binding or antibodies 
directed to the MMR protein itself can be used for IP . 
25 Immunoprecipitations: 

Immunoprecipitations are performed on in vitro translated proteins by mixing the 
translation reactions with 1 ug of the MLH1 specific monoclonal antibody (mAB) MLH14 
(Oncogene Science, Inc.), a polyclonal antibody generated to codons 2-20 of hPMS2 
described above, or apolyclonal antibody generated to codons 843-862 of hPMS2 (Santa 
30 Cruz Biotechnology, Inc.) in 400 ul of EBC buffer (50 mM Tris, pH 7.5, 0.1 M NaCl, 
0.5% NP40). After incubation for 1 hr at 4°C, protein A sepharose (Sigma) is added to a 
final concentration of 10% and reactions are incubated at 4°C for 1 hour. Proteins bound 
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to protein A are washed five times in EBC and separated by electrophoresis on 4-20% 
Tris-glycine gels, which are then dried and autoradiographed. 

Compounds that block heterodimerization of mutS or mutL proteins can now be 

identified using this assay. 

5 

EXAMPLE 5: Use of chemical MMR inhibitors yields microsatellite instability m 
human cells 

In order to demonstrate me global ability of a chemical inhibitor of MMR in host 
cells and organisms, we treated human HEK293 cells (referred to as 293 cells) with DMA 
10 and measured for microsateUite instability of endogenous loci using the BAT26 diagnostic 
marker (Hoang JM. et al (1997) Cancer Res. 57:300-303). Briefly, 10 s cells were grown 
in control medium or 250 pM DMA, a concentration that is found to be non-toxic, for 14 
to 17 days. Cells are then harvested and genomic DNA isolated using the salting out 
method (Nicolaides,N.C. etcd. (1991) Mo/. Cell Biol. 11:61 66-6176.). 
15 Various amounts of test DNAs from HCT1 16 (a human colon epithelial cell kne) 

and 293 were first used to determine the sensitivity of our microsateUite test. The B AT26 
alleles are known to be heterogeneous between these two cell lines and the products 
migrate at different molecular weights (Nicolaides personal observation). DNAs were 
diluted by limiting dilution to determine the level of sensitivity of the assay. DNAs were 
20 PGR amplified using the BAT26F: S^tgactacttttgacttcagcc-S' (SEQ ID NO:43) and the 
BAT26R: 5'-aaccattcaacatttttaaccc-3' (SEQ ID NO:44) primers in buffers as described 
(Nicolaides, N.C. et al (1995) Genomics 30:195-206). Briefly 1 pg to 100 ngs of DNA 
were amplified using the following conditions: 94'C for 30 sec, 58'C for 30 sec, 72'C for 
30 sec for 30 cycles. PCR reactions were electrophoresed on 12% polyacrylamide TBE 
25 gels(Novex)or4%agaro S egelsandstamedwimetMdiumbromid^ These studies found 
that 0. 1 ng of genomic DNA was the limit of detection using our conditions. 

To measure for microsateUite stability in 293 ceUs grown with or without DMA, 
0.1 ngs of DNA from DMA-treated or control 293 cells were amphfied using the reaction 
conditions above. Forty individual reactions were carried out for each sample to measure 
30 forminoralleles. Fig. 4A shows a typical result from these studies whereby BAT26 

aUeles were amplified from DMA-treated and untreated ceUs and analyzed on 12% PAGE 
gels (Novex). Alleles from DMA-treated cells showed the presence of an altered aUele 
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(asterisk) that migrated differently from the wild type allele. No altered alleles were found 
in the MMR-proficient control cells as expected since MI only occurs in MMR defective 
cell hosts. To confirm these data, PCRs were repeated using isolated BAT26 products. 
Primers and conditions were the same as described above except that reactions were 
5 amplified for 20 cycles. PCR products were gel-purified and cloned into T-tailed vectors 
(InVitrogen) as suggested by the manufacturer. Recombinant clones from DMA-treated 
and control cells were screened by PCR again using the BAT26 primers. Fifty bacterial 
colonies were analyzed for BAT26 structure by directly adding an aliquot of live bacteria 
to the PCR mix. PCR reactions were carried out as described above, and products were 
10 electrophoresed on 4% agarose gels and stained with ethidium bromide. As shown in 

Figure 4B, microsatellites from DMA-treated cells had alterations (asterisks) that made the 
marker length larger or smaller than the wild type allele found in control cells. 

To confirm that these differences in molecular weight were due to shifts within the 
polynucleotide repeat, a hallmark of defective MMR, five clones from each sample were 
15 sequenced using an ABI automated sequencer with an M13-R primer located in the T-tail 
vector backbone. Sequence analysis revealed that the control cell clone used in our studies 
was homozygous for the BAT26 allele with a 26nt polyA repeat Cells treated with DMA 
found multiple alleles ranging from the wild-type with 26 polyA repeat to shorter alleles 
(24 polyA repeat) and larger alleles (28 polyA repeat) (Fig. 5). 
20 These data corroborate the H36pCAR data in Example 1 and Fig. 3 and 

demonstrates the ability to block MMR with a chemical in a range of hosts. 



Example 6: Chemical inhibitors of MMR generate DNA hypermutability in Plants 
and new phenotypes. 

25 To determine if chemical inhibitors of MMR work across a diverse array of 

organisms, we explored the activity of DMA on Arabidopsis thaliana (AT), a member of 
the mustard plant femily, as a plant model system to study the effects of DMA on 
generating MMR deficiency, genome alterations, and new output traits. 

Briefly, AT seeds were sterilized with straight commercial bleach and 100 seeds 

30 were plated in 100mm Murashige and Skoog (MS) phytagar (Life Technology) plates with 
increasing amounts of DMA (ranging from lOOum to 50mM). A similar amount of seeds 
were plated on MS phytagar only or in MS phytagar with increasing amounts of EMS 
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10 



15 



20 



(lOOuM to 50mM), a mutagen commonly used to mutate AT seeds (McCallum, CM.et al. 
(2000) Nat. Biotechnol\%'ASS-A51). Plates were grown in a temperature-controlled, 
fluorescent-lighted humidifier (Percival Growth Chamber) for 10 days. After 10 days, 
seeds were counted to determine toxicity levels for each compound. Table 2 shows the 
number of healthy cells/treatment as determined by root formation and shoot formation. 
Plantlets that were identical to untreated seeds were scored healthy. Seeds with stunted 
root or shoot formation were scored intermediate (inter). Non-germinated seeds were 
scored dead. 

Table 2: Toxicity curve of DMA and EMS on Arabidopsis (per 100 cells) 




Healthy 



Inter 



Dead 



20 



15 



32 



85 



100 



100 




Healthy 99 



Inter 



Dead 



54 



75 



100 



100 



100 



100 



0 



100 



87 



The data in Table 2 show that DMA toxicity occurs at lOmM of continuous 
culture, while toxicity occurs at 250 uM for EMS. Next, 50 seeds were plated in two 
150mm dishes containing 2mM DMA 250 uM EMS or no drug. Seeds were grown for 10 
days and then 10 plants from each plate were transferred to soil. All plants appeared to be 
similar in color and height Plants were grown at room temperature with daily cycles of 1 8 
hr light and 6 hr dark. After 45 days seeds are harvested from siliques and stored in a 
desiccator at 4°C for 72 hours. Seeds are then sterilized and 100 seeds from each plant is 
sown directly into water-saturated soil and grown at room temperature with daily cycles of 
1 8 hr light and 6 hr dark. At day 10 phenotypically distinct plants were found in 7 out of 
1 18 DMA treated while no phenotypic difference was observed in 150 EMS-treated or 150 
control plants. These 7 altered plants were light green in color and appeared to grow 
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slower. Fig. 6 shows a typical difference between the DMA altered plant and controls. 
DMA-exposed plants produced offspring that were yellow in appearance in contrast to 
dark green, which is always found in wild-type plants. In addition, the yellow plants were 
also shorter. After 30 days, most wild-type plants produced flowers and siliques, while the 
5 7 mutants just began flowering. After 45 days, control plants were harvested while mutant 
plants were harvested 10 to 15 days later. No such effects were observed in 150 plantlets 

from EMS treated plants. 

The effect of DMA on MMR was confirmed by monitoring the structure of 
endogenous polynucleotide repeat markers within the plant genome. DNA was extracted 

10 using the DNAzol method following the manufacturer's protocol (Life Technology). 
Briefly, two leaves were harvested from DMA, EMS or untreated plants and DNA was 
extracted. DNAs were quantified by optical density using a BioRad Spectrophotometer. 
In Arabidopsis, a series of poly-A (A)„, (CA)„ and (GA)„ markers were found as a result of 
EMBL and GenBank database searches of DNA sequence data generated as a result of the 

15 Arabidopsis genome-sequencing project. Two markers that are naturally occurring, 

ATHACS and Ngal28 are used to monitor microsatellite stability using primers described 
(Bell, C.J. and J.R Ecker (1994) Genomics 19:137-144). ATHACS has a stretch of thirty- 
six adenine repeats (A) M whereas Ngal28 is characterized by a di-nucleotide AG repeat 
that is repeated nineteen times (AG) 19 while the Nga280 marker contains a polyAG repeat 

20 marker with 15 dinucleotides. DMA-mediated alterations of these markers are measured 
by a PCR assay. Briefly, the genomic DNA is amplified with specific primers in PCR 
reaction buffers described above using l-10ng plant genomic DNA. Primers for each 
marker are listed below: 



25 



n8 Sa280-F- 5'-CTGATCTCACGGACAATAGTGC-3' (SEQIDNO:5) 
nga280-R 5'-GGCTCCATAAAAAGTGCACC-3' (SEQ ID NO:6) 



n8 nltal28-F- 5'-GGTCTGTTGATGTCGTAAGTCG-3' (SEQIDNO:7) 
30 ngal28-R: 5 '-ATCTTGAAACCTTTAGGGAGGG-3 ' (SEQ ID NO:8) 

ATHACS* 

ATHACS-F: 5 ' - AGAAGTTT AGAC AGGT AC-3 ' (SEQIDNO:9) 
ATHACS-R: 5 ' -AAATGTGC AATTGCCTTC-3 ' (SEQ ID NO:10) 

35 
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Cycling conditions are 94°C for 15 seconds, 55°C for 15 seconds and 72°C for 30 
seconds, conditions that have been demonstrated to efficiently amplify these two markers 
(personal observation, Morphotek). PCR products are analyzed on 3.5% metaphor agarose 
gel in Tris-Acetate-EDTA buffer following staining with ethidium bromide. 
5 Another method used to demonstrate that biochemical activity of a plant host's 

MMR is through the use of reporter genes disrupted by a polynucleotide repeat, similar to 
that described in Example 1 and Fig. 1 . Due to the high endogenous P-galactosidase 
background, we engineered a plant compatible MMR-sensitive reporter gene consisting of 
the (5-glucoronidase (GUS) gene with a mononucleotide repeat that was inserted just 

10 downstream of the initiation codon. Two reporter constructs were generated. pGUS-OF, 
contained a 20 base adenine repeat inserted just downstream of the initiating methionine 
that resulted in a frameshift, therefore producing a nonfunctional enzyme. The second, 
pGUS-IF, contained a 19 base adenine repeat that retained an open reading frame and 
served as a control for (^glucuronidase activity. Both constructs were generated by PCR 

15 using the pBI-121 vector (Life Technologies) as template. The antisense primer was 

directed to the 3' end of the Nopaline Synthase (NOS) polytermination sequence contained 
within the pBI-121 plasmid and contained a unique EcoSl restriction site to facilitate 
cloning of the vector into the pBI-121 binary vector backbone. The sense primers 
contained a unique BaniHl restriction site to facilitate cloning of the chimeric GUS 

20 reporter gene into the pBI-121 binary vector backbone. The primers used to generate each 
reporter are: 

1. sense primer for pGUS-IF (uidA-ATG-polyA-IF) : 

5'- CCC GGA TCC ATG TTA AAA AAA AAA AAA AAA AAA CGT CCT GTA GAA ACC-3' (SEQ 
25 ID NO: 11) 

2. sense primer for pGUS-OF (uidA-ATG-polyA-OF) : 

5'- CCC GGA TCC ATG TTA AAA AAA AAA AAA AAA AAA ACG TCC TGT AGA AAC C-3' 
(SEQ ID NO: 12) 

30 

3 . antisense primer (Nos-term) : 

5'- CCC GAA TTC CCC GAT CTA GTA ACA TAG ATG- 3' (SEQ ID NO: 13) 



PCR amplifications were carried out using reaction buffers described above. 
35 Reactions were performed using 1 ng of pBI-121 vector as template (Life Technologies) 
and the appropriate corresponding primers. Amplifications were carried at 94°C for 30 
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seconds, 54°C for 60 seconds and 72°C for 60 seconds for 25 cycles. PCR products of the 
expected molecular weight was gel purified, cloned into T-tailed vectors (InVitrogen), and 
sequenced to ensure authentic sequence using the Mowing primers: CaMV-FORW. [= 5'- 
gat ate tec act gac gta ag-3'] (SEQ ID NO:30) for sequencing from the CaMV promoter 
5 into the 5' end of GUS cDNAs; NOSpA-42F [= 5'-tgt tgc egg tot tgc gat g-3'] (SEQ ID 
NO:31) for sequencing of the NOS terminator; NOSpA-Cend-R [= 5'-ccc gat eta gta aca 
tag atg-3'] (SEQ ID NO:32) for sequencing from the NOS terminator into the 3' end of the 
GUS cDNAs; GUS-63F [= 5'-cag tct gga teg cga aaa ctg-3'] (SEQ ID NO:33), GUS-441F 
[= 5 ._ ggt ga t tac cga cga aaa cg-3'] (SEQ ID NO:34), GUS-825F [= 5'-agt gaa ggg cga aca 
10 gtt cc-3 '] (SEQ ID NO:35), GUS-1224F [= 5'-gag tat tgc caa cga acc-3'] (SEQ ID NO:36), 
GUS-1596F [= 5'-gta tea ccg cgt ctt tga to-3*] (SEQ ID NO:37), GUS-265R [= 5'-cga aac 
gca gca cga tac g-3'] (SEQ ID NO:38), GUS-646R [= 5'-gtt caa cgc tga cat cac c-3'] (SEQ 
ID NO:39), GUS-1033R [« 5'-cat gtt cat ctg ccc agt cg-3'] (SEQ ID NO:40), GUS-1425R 
[= 5'-gct ttg gac ata cca tac-3'] (SEQ ID NO:41), and GUS-1783R [= 5'-cac cga agt tea tgc 
15 cag-3'] (SEQ ID NO:42) for the sequence of the full length GUS cDNAs. No mutation 
were found in either the OF or IF version of the GUS cDNA and the expected frames for 
both cDNAs were also confirmed. pCR-IF-GUS and pCR-OF-GUS plasmids were 
subsequently digested with the BamH I and EcoR I restriction endonucleases, to generate 
DNA fragments containing the GUS cDNA along with the NOS terminator. These 
20 fragments were ligated into the BamH I and the EcoR I sites of the pBI-121 plasmid, 
which was prepared for cloning by cutting it with the same enzymes to release the wild 
type GUS cDNA. The resulting constructs (pBI-IF-GUS and pBI-OF-GUS) were 
subsequently digested with Hind ffl and EcoR I to release the DNA fragments 
encompassing the CaMV promoter, the IF or OF GUS cDNA, and the NOS terminator. 
25 Finally, these fragments were ligated into the correspondent restriction sites present in the 
pGPTV-HPT biliary vector (ATCC) to obtain the pCMV-IF-GUS-HPT and pCMV-OF- 

GUS-HPT binary vectors. 

The resulting vectors, CMV-OF-GUS-HPT and CMV-IF-GUS-HPT now contain 
the CaMV35S promoter from the Cauliflower Mosaic 35 S Virus driving the GUS gene 
30 foUowedbyaNOSterminatorandpolyadenylationsignal(Fig.7). In addition, this vector 
also contains a hygromycin resistance gene as a selectable marker that is used to select for 
plants containing this reporter. 
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Generation of GUS reporter-expressing Arabidopsis thaliana transgenic plants. 

Agrobacterium tumefaciens bacteria are used to shuttle binary expression vectors 
into plants. To generate p-glucoronidase-expressing Arabidopsis thaliana (A. thaliana) 
plants, Agrobacterium tumefaciens cells (strain GV3101) were electroporated with the 
5 CMV-OF-GUS-HPT or the CMV-IF-GUS-HPT binary vector using methods known by 
those skilled in the art. Briefly, one-month old A. thaliana (ecotype Columbia) plants 
were infected by immersion in a solution containing 5% sucrose, 0.05% silwet and binary 
vector-transformed Agrobacteria cells for 10 seconds. These plants were then grown at 
25°C under a 16 hour day and 8 hour dark photoperiod. After 4 weeks, seeds (referred to as 
10 Tl) were harvested and dried for 5 days. Thirty thousands seeds from ten CMV-OF-GUS- 
HPT or CMV-IF-GUS-HPT-transformed plants were sown in solid Murashige and Skoog 
(MS) media plates in the presence of 20 ug/ml of hygromycin (HYG). Three hundred 
plants were found to be HYG resistant and represented GUS expressing plants. These 
plants along with 300 control plants were grown in MS media for two weeks and then 
15 transferred to soil. Plants were grown for an additional four weeks under standard 
conditions at which time T2 seeds were harvested. 

To confirm the integration and stability of the GUS vector in the plant genome, 
gene segregation and PCR analyses were conducted. Commonly, three out of four Tl 
plants transformed by Agrobacteria technology are expected to carry the vector inserted 
20 within a single locus and are therefore considered heterozygous for the integrated gene. 
Approximately 75% of the seeds (T2) generated from most of the Tl plants were found 
HYG-resistant and this in accordance with the expected 1:2:1 ratio of null (no GUS 
containing plants), heterozygous, and homozygous plants, respectively, in self-pollinating 
conditions. To confirm that these plants contained the GUS expression vector, genomic 
25 DNA was isolated from leaves of Tl plants using the DNAzol-mediated technique as 
described above. One ng of genomic DNA was analyzed by polymerase chain reaction 
(PCR) to confirm the presence of the GUS vector. PCR was carried out for 25 cycles with 
the following parameters: 95°C for 30 seconds; 54°C for 1 minute; and 72°C for 2 minutes 
using primers listed above. Positive reactions were observed in DNA from CMV-OF- 
30 GUS-HPT and CMV-IF-GUS-HPT-transformed plants and not from control (uninfected) 
plants. 
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In order to assess the expression of the GUS in Tl plants, leaf tissue was collected 
from Tl plants, homogenized in liquid nitrogen using glass pestles, and suspended in RLT 
lysing buffer (Qiagen, RNeasy plant RNA extraction kit). Five micrograms of total RNA 
was purified according to the manufacturer's suggested protocol and then loaded onto a 
5 1.2% agarose gel (lx MOPS buffer, 3% formaldehyde), size-fractionated by 

electrophoresis, and transferred onto N-Hybond+ membrane (Amersham). Each membrane 
was incubated at 55°C in 10 ml of hybridization solution (North2South labeling kit, Pierce) 
containing 100 ng of GUS, tubulin, or HYG probes, which were generated by PCR 
amplification, according to the manufacturer's directions. Membranes were washed three 
10 times in 2x SSC, 0.1% SDS at 55°C, and three times in 2x SSC at ambient temperature. 
Detection was carried out using enhanced chemUuminescence (ECL). GUS message was 
detected in three out often analyzed transgenic lines, while no signal was found in the 
control plants. Collectively these studies demonstrated the generation of GUS expressing 
transgenic .4. thaliana plants. 
15 - To determine the status of MMR activity in host plants, one can measure for the 

production of functional p-glucoronidase by staining plant leaves or roots in situ for p-gm 
activity. Briefly, plant tissue is washed twice with water and fixed in 4 mis of 0.02% 
glutaraldehyde for 15 minutes. Next, tissue is rinsed with water and incubated in X-glu 
solution [0.1M NaP0 4 , 2.5 mM K 3 Fe(CN) 6 , 2.5mM K<Fe(CN) 6 , 1.5 mM MgCl 2 , and 1 
20 mg/ml X-GLU (5 bromo-4-cMoro-3-indoyl- p-D-glucuronide sodium salt) (Gold 

Biotechnology)] for 6 hours at 37°C. Tissues are then washed twice in phosphate buffered 
saline (PBS) solution, once in 70% ethanol and incubated for 4 hours in methanol:acetone 
(3:1) for 8 hours to remove chlorophyll. Tissues are then washed twice in PBS and stored 
in PBS with 50% glycerol. Plant tissue with functional GUS activity will stain blue. 
25 The presence of GUS activity in CMV-1F-GUS-HPT plants indicates that the in- 

frame N-terminus insertion of the poly A repeat does not disrupt the GUS protein function. 
The CMV-OF-GUS-HPT plants treated with DMA, EMS or untreated are tested to 
determine if these plants produce GUS activity. The presence of GUS activity in DMA 
Heated plants indicates that the polyA repeat was altered, therefore, resulting in a frame- 
30 restoring mutation. Agents such as EMS, which are known to damage DNA by alkylation 
cannot affect the stability of a polynucleotide repeat This data indicates that plants are 
defective for MMR, the only process known to be responsible for MI. 
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These data demonstrate the utility and power of using a chemical inhibitor of 
MMR to generate a high degree of genetic alteration that is not capable by means of 
standard DNA damaging drugs. Moreover, this application teaches of the use of reporter 
genes such as GUS-OF in plants to monitor for the MMR activity of a plant host. 

5 

EXAMPLE 7: Use of chemical MMR inhibitors yields microsatellite instability in 
microbes. 

To demonstrate the ability of chemical inhibitors to block MMR in a wide range of 
hosts, we employed the use of Pichia yeast containing a pGUS-OF reporter system similar 

10 to that described in Example 5. Briefly, the GUS-OF and GUS-IF gene, which contains a 
polyA repeat at the N-terminus of the protein was subcloned from the pCR-IF-GUS and 
pCR-OF-GUS plasmids into the EcoRI site of the pGP vector, which is a consitutively 
expressed yeast vector containing a zeocin resistance gene as selectable marker. pGP- 
GUS-IF and pGP-GUS-OF vectors were electroporated into competent Pichia cells using 

15 standard methods known by those skilled in the art. Cells were plated on YPD agar (lOg/L 
yeast extract; 20 g/L peptone; 2% glucose; 1.5% bactoagar) plates containing 100 ug/ml 
zeocin. Recombinant yeast are then analyzed for GUS expression/function by replica 
plating on YPD agar plates containing 100 ug/ml zeocin plus 1 mg/ml X-glu (5-bromo-4- 
chloro-3-indoyl-beta-D-glucuronide sodium salt) and grown at 30°C for 16 hours. On 

20 hundred percent of yeast expressing GUS-IF were found to turn blue in the presence of the 
X-glu substrate while none of the control yeast turned blue. None of the yeast containing 
the GUS-OF turned blue in the presence of the X-glu substrate under normal growth 
conditions. 

To demonstrate the ability of chemicals to block MMR in yeast, GUS-OF and 
25 control cells were incubated with 300 pM DMA, EMS, or no chemical for 48 hours. After 
incubation, yeast were plated on YPD-ZEO-X-GLU plates and grown at 30°C for 16 hours. 
After incubation, a subset of yeast expressing GUS-OF contain blue subclones, while 
none are seen in EMS or control cells. These data demonstrate the ability of chemicals to 
block MMR of microbes in vivo to produce subclones with new output traits. 



WO 02/054856 



35 



PCT/US01/00934 



EXAMPLE 8: Classes of other chemicals capable of blocking MMR in vivo 

The discovery of anthracene compounds presents a new method for blocking MMR 
activity of host organisms in vivo'. While 9,10-dimemylanthracene (DMA) was found to 
block MMR in cell hosts, other analogs with a similar chemical composition from this 
5 class are also claimed in this invention. These include anthracene and related analogs such 
as 9,10-diphenylanthracene and 9,10-di-M-tolylanthracene. Myers et al. ((1988) Biochem. 
Biophys. Res. Commun. 151:1441-1445) disclosed that at high concentrations, DMA acts 
as a potent weak mutagen, while metabolized forms of DMA are the "active" ingredients 
in promoting mutation. This finding suggests that metabolites of anthracene-based 
10 compounds may also act as active inhibitors of MMR in vivo. For instance, metabolism of 
anthracene and 9, 1 0-dimethylanthracene by Micrococcus sp., Pseudomonas sp. and 
Bacillus macerans microbes have found a number of anthracene and 9,10- 
dimethylanthracene metabolites are formed. These include anthracene and 9,10- 
dimethylanthracene cis-dihydrodiols, hydroxy-methyl-derivatives and various phenolic 
15 compounds. Bacteria metabolize hydrocarbons using the dioxygenase enzyme system, 
which differs from the mammalian cytochrome P-450 monoxygenase. These findings 
suggest the use of bacteria for biotransforming anthracene and DMA for additional MMR 
blocking compounds (Traczewska, T.M. et al. (1991) Acta. Microbiol. Pol. 40:235-241). 
Metabolism studies of DMA by rat-liver microsomal preparations has found that this 
20 molecule is converted to 9-Hydroxymethyl- 1 0-methylanthracene (9-OHMeMA) and 9,10- 
dmydroxymemyl-anthracene (9,10-DiOHMeA) (Lamparczyk, H.S. et al. (1984) 
Carcinogenesis 5:1405-1410). In addition, the traM-l,2-dmydro-l,2-dihydroxy derivative 
of DMA (DMA 1,2-diol) was found to be a major metabolite as determined by 
chromatographic, ultraviolet (UV), nuclear magnetic resonance (NMR), and mass spectral 
25 properties. DMA 1,2-diol was also created through the oxidation of DMA in an ascorbic 
acid-ferrous sulfate-EDTA system Other dihydrodiols that are formed from DMA by 
metabolism are the trans-1,2- and 3,4-dihydrodiols of 9-OHMeMA (9-OHMeMA 1,2-diol 
and 9-OHMeMA 3,4-diol) while the further metabolism of DMA 1,2-diol can yield both of 
these dihydrodiols. Finally, when 9-OHMeMA is further metabolized, two main 
30 metabolites are formed; one was identified as 9,10-DiOHMeA and the other appeared to be 
9-OHMeMA 3,4-diol. 
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Hie metabolism of 9-methylanthracene (9-MA), 9-hydroxymethylanthracene (9- 
OHMA), and 9,10-dimethylanthracene (9,10-DMA) by fungus also has been reported 
(Cerniglia, C.E. et al. (1990) AppL Environ. Microbiol. 56:661-668). These compounds 
are also useful for generating DMA derivatives capable of blocking MMR. Compounds 9- 

5 MA and 9,10-DMA are metabolized by two pathways, one involving initial hydroxylation 
of the methyl group(s) and the other involving epoxidation of the 1,2- and 3,4- aromatic 
double bond positions, followed by enzymatic hydration to form hydroxymethyl trans- 
dihydrodiols. For 9-MA metabolism, the major metabolites identified are trans-1,2- 
dihydro-l,2-dihydroxy and 1ians-3,4-dmydro-3,4-dihydroxy derivatives of 9-MA and 9- 

10 OHMA, whereby 9-OHMA can be further metabolized to trans-1,2- and 3,4-dihydrodiol 
derivatives. Circular dichroism spectral analysis revealed that the major enantiomer for 
each dihydrodiol was predominantly in the S,S configuration, in contrast to the 
predominantly RR configuration of the trans-dihydrodiol formed by mammalian enzyme 
systems. These results indicate that Caenorhabditis elegans metabolizes methylated 

15 anthracenes in a highly stereoselective manner that is different from that reported for rat 
liver microsomes. 

The analogs as listed above provide an example but are not limited to anthracene- 
derived compounds capable of eliciting MMR blockade. Additional analogs that are of 
potential use for blocking MMR are shown in Fig.8. 



20 



Other classes of small molecular weight compounds that are capable of blocking 
MMR in vivo. 

MMR is a multi-step process that involves the formation of protein complexes that 
detect mismatched bases or altered repetitive sequences and interface these mutations with 

25 enzymes that degrade the mutant base and repair the DNA with correct nucleotides. First, 
mismatched DNA is recognized by the mutS heterodimeric complex consisting of MSH2 
and GTBP proteins. The DNA bound mutS complex is then recognized by the mutL 
heterdimeric complex that consists of PMS2 and MLH1 proteins. The mutL complex is 
thought to interface exonucleases with the mismatched DNA site, thus initiating this 

30 specialized DNA repair process. After the mismatched bases are removed, the DNA is 
repaired with a polymerase. 
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There are several steps in the normal process that can be targeted by small 
molecular weight compounds to block MMR. This application teaches of these steps and 
the types of compounds that may be used to block this process. 



5 ATPase inhibitors: 

The finding that nonhydrolyzable forms of ATP are able to suppress MMR in vitro 
also suggest that the use for this type of compound can lead to blockade of MMR in vivo 
and mutation a host organism's genome (Galio, L. et al (1999) Nucl. Acids Res. 27:2325- 
2331; Allen, D.J. et al (1997) EMBO J. 16:4467-4476; Bjornson, K.P. et al. (2000) 
10 Biochem. 39:3 1 76-3 1 83). One can use a variety of screening methods described within 
this application to identify ATP analogs that block the ATP-dependent steps of mismatch 
repair in vivo. 



Nuclease inhibitors: 

15 The removal of mismatched bases is a required step for effective MMR (Harfe, 

B.D. and S. Jinks-Robertson (2000) Ann. Rev. Genet. 34:359-399). This suggests that 
compounds capable of blocking this step can lead to blockade of MMR in vivo and 
mutation a host organism's genome. One can use a variety of screening methods described 
within this application to identify nuclease inhibitors analogs that block the nuclease steps 

20 of mismatch repair in vivo. An example of the types of nuclease inhibitors are but not 
limited to analogs of N-Ethylmaleimide, an endonuclease inhibitor (Huang, Y.C., etal. 
(1995) Arch. Biochem. Biophys. 316:485), heterodimeric adenine-chain-acridine 
compounds, exonulcease EI inhibitors (Belmont P, etal., BioorgMed Chem Lett (2000) 
10:293-295), as well as antibiotic compounds such as Heliquinomycin, which have 

25 helicase inhibitory activity (Chino, M, etal. J. Antibiot (Tokyo) (1998) 51:480-486). 



Polymerase inhibitors: 

Short and long patch repair is a required step for effective MMR (Modrich, P. 
(1994) Science 266:1959-1960). This suggests that compounds capable of blocking 
30 MMR-associated polymerization can lead to blockade of MMR in vivo and mutation a host 
organism's genome. One can use a variety of screening methods described within this 
application to identify polymerase inhibitors analogs that block the polymerization steps of 
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mismatch repair in vivo. An example of DNA polymerase inhibitors that are useful in 
blocking MMR activity include, but are not limited to, analogs of actinomycinD (Martin, 
S J., etal. (1990) J. Immunol 145:1859), Aphidicolin (Kuwakado, K. etal (1993) " 
Biochem. Pharmacol 46:1909) l<2'-Deoxy-2'-fluoro-beta-L-arabmorurano^^ 
methyluracil (L-FMAU) (KukhanovaM etal., Biochem Pharmacol (1998) 55:1181- 
1187), and 2',3'-dideoxyribonucleoside 5'-triphosphates (ddNTPs) (Ono, K., etal., Biomed 
Pharmacother (1984) 38:382-389). 



10 



Chemical Inhibitors of Mismatch Repair Gene Expression 

MMR is a multi-protein process that requires the cooperation of several proteins 
such as but not limited to mutS homologs, MSH2, MSH3, MSH6, GTBP; mutL homologs 
PMS1, PMS2, MLH1; and exonucleases and helicases such as MutH and MutY (Harfe, 
B.D. and S. Jinks-Robertson (2000)^. Rev. Genet. 34:359-399; Modrich, P. (1994) 
Science 266:1959-1960). Chemicals capable of blocking the expression of these genes can 
15 lead to the blockade of MMR An example of a chemical that is capable of blocking 
MMR gene expression is an ohgodeoxynucleotide that can specifically bind and degrade 
anMMR gene message and protein production as described by ChauhanDP, etal. (Clin 
Cancer Res (2000) 6:3827-3831). One can use a variety of screening methods described 
within this application to identify inhibitors that block the expression and/or function of 
20 MMR genes in vivo. 

DISCUSSION 

The results described herein demonstrate the use of chemicals that can block 
mismatch repair of host organisms in vivo to produce genetic mutations. The results also 
demonstrate the use of reporter systems in host cells and organisms that are useful for 
25 screening chemicals capable of blocking MMR of the host organism. Moreover, the 
results demonstrate the use of chemical inhibitors to block MMR in mammalian cells, 
microbes, and plants to produce organisms with new output traits. The data presented 
hereinprovide novel approaches for producing genetically altered plants, microbes, and 
mammalian cells with output traits for commercial applications by inhibiting MMR with 
30 chemicals. This approach gives advantages over others that require the use of recombinant 
techniques to block MMR or to produce new output traits by expression of a foreign gene. 
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This method will be useful in producing genetically altered host organisms for 
agricultural, chemical manufacturing, pharmaceutical, and environmental applications. 

PMS2 (mouse) (SEQ ID NO:14) 

5 MEQTEGVSTE CAKAIKPIDG KSVHQICSGQ VILSLSTAVK ELIENSVDAG ATTIDLRLKD 60 
YGVDLIEVSD NGCGVEEENF EGLALKHHTS KIQEFADLTQ VETFGFRGEA LSSLCALSDV 120 
TISTCHGSAS VGTRLVFDHN GKITQKTPYP RPKGTTVSVQ HLFYTLPVRY KEFQRNIKKE 180 
YSKMVQVLQA YCIISAGVRV SCTNQLGQGK RHAWCTSGT SGMKENIGSV FGQKQLQSLI 240 
PFVQLPPSDA VCEEYGLSTS GRHKTFSTFR ASFHSARTAP GGVQQTGSFS SSIRGPVTQQ 300 

10 RSLSLSMRFY HMYNRHQYPF WLNVSVDSE CVDINVTPDK RQILLQEEKL LLAVLKTSLI 360 
GMFDSDANKL NVNQQPLLDV EGNLVKLHTA ELEKPVPGKQ DNSPSLKSTA DEKRVASISR 420 
LREAFSLHPT KEIKSRGPET AELTRSFPSE KRGVLSSYPS DVISYRGLRG SQDKLVSPTD 480 
SPGDCMDREK IEKDSGLSST SAGSEEEFST PEVASSFSSD YNVSSLEDRP SQETINCGDL 540 
DCRPPGTGQS LKPEDHGYQC KALPLARLSP TNAKRFKTEE RPSNVNISQR LPGPQSTSAA 600 

15 EVDVAIKMNK RIVLLEFSLS SLAKRMKQLQ HLKAQNKHEL SYRKFRAKIC PGENQAAEDE 660 
LRKEISKSMF AEMEILGQFN LGFIVTKLKE DLFLVDQHAA DEKYNFEMLQ QHTVLQAQRL 720 
ITPQTLNLTA VNEAVLIENL EIFRKNGFDF VIDEDAPVTE RAKLISLPTS KNWTFGPQDI 780 
DELIFMLSDS PGVMCRPSRV RQMFASRACR KSVMIGTALN ASEMKKLITH MGEMDHPWNC 840 
PHGRPTMRHV ANLDVISQN 859 



20 



PMS2 (mouse cDNA) (SEQ ID NO:15) 



60 



gaattccggt gaaggtcctg aagaatttcc agattcctga gtatcattgg aggagacaga 
taacctgtcg tcaggtaacg atggtgtata tgcaacagaa atgggtgttc ctggagacgc 120 
25 gtcttttccc gagagcggca ccgcaactct cccgcggtga ctgtgactgg aggagtcctg 180 
catccatgga gcaaaccgaa ggcgtgagta cagaatgtgc taaggccatc aagcctattg 240 
atgggaagtc agtccatcaa atttgttctg ggcaggtgat actcagttta agcaccgctg 300 
tgaaggagtt gatagaaaat agtgtagatg ctggtgctac tactattgat ctaaggctta 360 
aagactatgg ggtggacctc attgaagttt cagacaatgg atgtggggta gaagaagaaa 420 
30 actttgaagg tctagctctg aaacatcaca catctaagat tcaagagttt gccgacctca 480 
cgcaggttga aactttcggc tttcgggggg aagctctgag ctctctgtgt gcactaagtg 540 
atgtcactat atctacctgc caogggtctg caagcgttgg gactcgactg gtgtttgacc 600 
ataatgggaa aatcacccag aaaactccct acccccgacc taaaggaacc acagtcagtg 660 
tgcagcactt attttataca ctacccgtgc gttacaaaga gtttcagagg aacattaaaa 720 
35 aggagtattc caaaatggtg caggtcttac aggcgtactg tatcatctca gcaggcgtcc 780 
gtgtaagotg cactaatcag ctcggacagg ggaagcggca cgctgtggtg tgcacaagcg 840 
gcacgtctgg catgaaggaa aatatcgggt ctgtgtttgg ccagaagcag ttgcaaagcc 900 
tcattccttt tgttcagctg ccccctagtg acgctgtgtg tgaagagtac ggcctgagca 960 
cttcaggacg ccacaaaacc ttttctacgt ttcgggcttc atttcacagt gcacgcacgg 1020 
40 cgccgggagg agtgcaacag acaggcagtt tttcttcatc aatcagaggc cctgtgaccc 1080 
agcaaaggtc tctaagcttg tcaatgaggt tttatcacat gtataaccgg catcagtacc 1140 
catttgtcgt ccttaacgtt tccgttgact cagaatgtgt ggatattaat gtaactccag 1200 
ataaaaggca aattctacta caagaagaga agctattgct ggccgtttta aagacctcct 1260 
tgataggaat gtttgacagt gatgcaaaca agcttaatgt caaccagcag ccactgctag 1320 
45 atgttgaagg taacttagta aagctgcata .ctgcagaact agaaaagcct gtgccaggaa 1380 
agcaagataa ctctccttca ctgaagagca cagcagacga gaaaagggta gcatccatct 1440 
ccaggctgag agaggccttt tctottcatc ctactaaaga gatcaagtct aggggtocag 1500 
agactgctga actgacacgg agttttccaa gtgagaaaag gggcgtgtta tcctcttatc 1560 
cttcagacgt catctcttac agaggcctcc gtggctcgca ggacaaattg gtgagtccca 1620 
50 cggacagccc tggtgactgt atggacagag agaaaataga aaaagactca gggctcagca 1680 
gcacctcagc tggctctgag gaagagttca gcaccccaga agtggccagt agctttagca 1740 
gtgactataa cgtgagctcc ctagaagaca gaccttctca ggaaacoata aactgtggtg 1800 
acctggactg ccgtcctcca ggtacaggac agtccttgaa gccagaagac catggatatc 1860 
aatgcaaagc tctacctcta gctcgtctgt cacccacaaa tgccaagcgc ttcaagacag 1920 
55 aggaaagacc ctcaaatgtc aacatttctc aaagattgcc tggtcctcag agcacctcag I960 
cagctgaggt cgatgtagcc ataaaaatga ataagagaat cgtgctcctc gagttctctc 2040 
tgagttctct agctaagcga atgaagcagt tacagcacct aaaggcgcag aacaaacatg 2100 
aactgagtta cagaaaattt agggccaaga tttgccctgg agaaaaccaa gcagcagaag 2160 
atgaactcag aaaagagatt agtaaatcga tgtttgcaga gatggagatc ttgggtcagt 2220 
60 ttaacctggg atttatagta accaaactga aagaggacct cttcctggtg gaccagcatg 2280 
ctgcggatga gaagtacaac tttgagatgo tgcagcagca cacggtgctc caggcgcaga 2340 
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ggctcatcac accccagact ctgaacttaa ctgctgtcaa tgaagctgta ctgatagaaa 2400 
atctggaaat attcagaaag aatggctttg actttgtcat tgatgaggat gctccagtca 2460 
ctgaaagggc taaattgatt tccttaccaa ctagtaaaaa ctggaccttt ggaccccaag 2520 
atatagatga actgatcttt atgttaagtg acagccctgg ggtcatgtgc cggccctcac 2580 

5 gagtcagaca gatgtttgct tccagagcct gtcggaagtc agtgatgatt ggaacggcgc 2640 
tcaatgcgag 8gagatgaag aagctcatca cccacatggg tgagatggac cacccctgga 2700 
ac?gc?c?ca cggcaggcca accatgaggc acgttgccaa tctggatgtc atctctcaga 2760 
actgacacac cllttgtagc atagagttta ttacagattg ttcggtttgc aaagagaagg 2820 
ttttaagtaa tctgattatc gttgtacaaa aattagcatg ctgctttaat gtactggatc 2880 

10- cait?alaag cag?gttaag gcaggcatga tggagtgttc ctctagctca gctacttggg 2940 
tgatccggtg ggagctcatg tgagcccagg actttgagac cactccgagc cacattcatg 3000 
agac?c!att caaggacaaa aaaaaaaag* tatttttgaa gccttttaaa aaaaaa 3056 

PMS2 (human) (SEQ ID NO:16) 

15 MERAESSSTE PAKAIKPIDR KSVHQICSGQ WLSLSTAVK ELVENSLDAG MKIDLKLKD 60 

YGVDLIEVSD NGCGVEEENF EGLTLKHHTS KIQEFADLTQ VETFGFRGEA LSSLCALSDV 120 

TISTCHASAK VGTRLMFDHN GKIIQKTPYP RPRGTTVSVQ QLFSTLPVRH KEFQRNIKKE 180 

ySqVLHA YCIISAGIRV SCTNQLGQGK RQPWCTGGS PSIKENIGSV FGQKQLQSLI 240 

vceeyglscs daWyis gfisqcthgv grsstdrqff finrrpcdpa 300 

?ft KVTRLVNEVY HMYNRHQYPF WLNISVDSE CVDINVTPDK RQILLQEEKL LLAVLKTSLI 360 
S5S S V EGNLIKMHAA DLEKPMVEKQ DQSPSLRTGE EKKDVSISRL 420 
REAFSLRHTT ENKPHSPKTP EPRRSPLGQK RGMLSSSTSG AISDKGVLRP QKEAVSSSHG 480 
SSSJ EKMGHGSTS VDSEGFSIPD TGSHCSSEYA ASSPGDRGSQ EHVDSQEKAP 540 
E^SdSFSDVD CHSNQEDTGC KFRVLPQPTN LATPNTKRFK KEEILSSSDI CQKLVNTQDM 600 

25 ISdVAVK INKKVVPLDF SMSSLAKRIK QLHHEAQQSE GEQNYRKFRA KICPGENQAA 660 
?£??RKEISK TMFAEMEIIG QFNLGFIITK LNEDIF1VDQ HATDEKYNFE MLQQHTVLQG 720 
SSSSSS LTAVNEAVLI ENLBIFRKNG FDFVIDENAP VTERAKLISL PTSKNWTFGP 780 
qSS SSSSScS SRVKQMFASR ACRKSVMIGT ALNTSEMKKL ithmgemdhp mo 
WNCPHGRPTM RHIANLGVIS QN 

30 

PMS2 (human cDNA) (SEQ ID NO:17) 

cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgct 60 
aagglca?ca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 120 
ctglgtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 180 
35 aa?aItgato taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 240 
tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 300 
caagag?ttg ccgacctaac tcaggttgaa acttttggct ttcgggggga ^"=tctgagc 360 
tllltttatl cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 420 
ao?cgac?ga tgtttgatL caatgggaaa attatccaga aaacccccta cccccgcccc 480 
40 agag?gacla Sgtcigcgt gcagcagtta ttttccaoac tacctgtgcg ccataaggaa 540 
?t?Slagga atattalgaa ggagtatgcc aaaatggtcc aggtcttaca tgcatactgt 600 
atcatttSg caggcatlcg tgtaagttgc accaatcagc ttggacaagg aaaacgacag 660 
cctgtggtat gcacaggtgg aagccccagc ataaaggaaa atatcggctc tgtgtttggg 720 
caglaacagt tgcaaagcct cattcctttt gttcagctgc cccctagtga ctccgtgtgt 780 
45 galgagtaSg gtttgagctg ttcggatgct ctgcataatc ttttttacat ctcaggtttc 840 
Itttclcaat gcacgcatgg agttggaagg agttcaacag acagacagtt "tctttatc 900 
aaccggcggc cttgtgaccc agcaaaggtc .tgcagactcg tgaatgaggt ^taccacatg 960 
?ataatcgac accagtatcc atttgttgtt ,-cttaacattt ctgttgattc agaatgcgtt 1020 
gatatcaatg ttaclccaga taaaaggcaa attttgctac aagaggaaaa ^ttttgttg 1080 
50 gcagttttal agacctcttt gataggaatg tttgatagtg atgtcaacaa gctaaatgtc 1140 
Igtcagcagc caotgctgga tgttgaaggt aacttaataa aaatgcatgc agcggatttg 1200 
alaaagccL tggtagalla gcaggatcaa tccccttcat taaggactgg agaagaaaaa 1260 
aaagacgtgt ccatttccag actgcgagag gccttttctc ttcgtcacac aacagagaac 1320 
aag?c?S?a gccoaaagac tcclgaacca agaaggagcc ctctaggaca ga^aggggt 1380 
55 atgctgtctt Itagcacttc aggtgccatc tctgacaaag gcgtcctgag acctcagaaa 1440 
gaggclgtga gttccagtca cggacccagt gaccctacgg acagagcgga g^^g 1500 
gactcggggc acggcagcac ttccgtggat tctgaggggt tcagcatccc agacacgggc 1560 
lg?cal?gL gcagcgagta tgcggccagc tccccagggg acaggggctc V«OT»»t 1620 
glggactctc aggagaaagc gcctgaaact gacgactctt tttcagatgt fgactgccat 1680 
60 ?claaccagg algataccgg atgtaaattt cgagttttgc ctcagccaac taatctcgca 1740 
accccaaaca caLgcgt?t taaaaaagaa gaaattcttt ccagttctga catttgtcaa 1800 
aagttagtaa atactclgga catgtcagcc tctcaggttg atgtagctgt gaaaattaat 1860 
aagaaagttg tgcccctgga cttttctatg agttctttag ctaaacgaat aaagcagtta 1920 
calcatgaag cacagcaaag tgaaggggaa cagaattaca ggaagtttag ggcaaagatt 1980 
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tgtcctggag aaaatcaagc agccgaagat 
tttgcagaaa tggaaatcat tggtcagttt 
gaggatatct tcatagtgga ccagcatgcc 
cagcagcaca ccgtgctcca ggggcagagg 
5 gctgttaatg aagctgttct gatagaaaat 
tttgttatcg atgaaaatgc tccagtcact 
agtaaaaact ggaccttcgg accccaggac 
agccctgggg tcatgtgccg gccttcccga 
cggaagtcgg tgatgattgg gactgctctt 
10 cacatggggg agatggacca cccctggaac 
atcgccaacc tgggtgtcat ttctcagaac 
tttatcgcag atttttatgt tttgaaagac 
atgaaacctg ctacttaaaa aaaatacaca 
cttttcaaac c 

15 

PMS1 (human) (SEQ ID N0:18) 

MKQLPAATVR LLSSSQIITS WSVVKELIE 
IKAVDAPVMA MKYYTSKINS HEDLENLTTY 
YVLDGSGHIL SQKPSHLGQG TTVTALRLFK 

20 ILKPDLRIVF VHNKAVIWQK SRVSDHKMAL 
PKCDADHSFT SLSTPERSFI FINSRPVHQK 
VPTADVDVNL TPDKSQVLLQ NKESVLIALE 
SKTAETDVLF NKVESSGKNY SNVDTSVIPF 
CSSEISNIDK NTKNAFQDIS MSNVSWENSQ 

25 NEEEAGLENS SEISADEWSR GNILKNSVGE 
LNEDSCNKKS NVIDNKSGKV TAYDLLSNRV 
ATLQIEELWK TLSEEEKLKY EEKATKDLER 
NLAQKHKLKT SLSNQPKLDE LLQSQIEKRR 
KDEPCLIHNL RFPDAWLMTS KTEVMLLNPY 

30 SLFNGSHYLD VLYKMTADDQ RYSGSTYLSD 
- CLPFYGVADL KEILNAILNR NAKEVYECRP 
IYRMKHQFGN EIKECVHGRP FFHHLTYLPE 



gaactaagaa 
aacctgggat 
acggacgaga 
ctcatagcac 
ctggaaatat 
gaaagggcta 
gtcgatgaac 
gtcaagcaga 
aacacaagcg 
tgtccccatg 
tgaccgtagt 
agagtcttca 
tcacacccat 



NSLDAGATSV 
GFRGEALGSI 
NLPVRKQFYS 
MSVLGTAVMN 
DILKLIRHHY 
NLMTTCYGPL 
QNDMHNDESG 
TEYSKTCFIS 
NIEPVKILVP 
IKKPMSASAL 
YNSQMKRAIE 
SQNIKMVQIP 
RVEEALLFKR 
PRLTANGFKI 
RKVISYLEGE 
TT 



aagagataag 
ttataataac 
agtataactt 
ctcagactct 
ttagaaagaa 
aactgatttc 
tgatcttcat 
tgtttgcctc 
agatgaagaa 
gaaggccaac 
cactgtatgg 
ctaacctttt 
ttaaaagtga 



DVKLENYGFD 
CCIAEVLITT 
TAKKCKDEIK 
NMESFQYHSE 
NLKCLKESTR 
PSTNSYENNK 
KNTDDCLNHQ 
SVKHTQSENG 
EKSLPCKVSN 
FVQDHRPQFL 
QESQMSLKDG 
FSMKNLKINF 
LLENHKLPAE 
KLIPGVSITE 
AVRLSRQLPM 



taaaacgatg 
caaactgaat 
cgagatgctg 
caacttaact 
tggctttgat 
cttgccaact 
gctgagcgac 
cagagcctgc 
actgatcacc 
catgagacac 
aataattggt 
ttgttttaaa 
tcttgagaac 



KIEVRDNGEG 
RTAADNFSTQ 
KIQDLLMSFG 
ESQIYLSGFL 
LYPVFFLKID 
TDVSAADIVL 
ISIGDFGYGH 
NKDHIDESGE 
NNYPIPEQMN 
IENPECTSLED 
RKKIKPTSAW 
KKQNECVDLEE 
PLEKPIMLTE 
NYLEIEGMAN 
YLSKEDIQDI 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2771 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
932 



PMS1 (human) (SEQ ID N0:19) 

35 ggcacgagtg gctgcttgcg gctagtggat ggtaattgcc tgcctcgcgc 

ctgctctgtt aaaagcgaaa atgaaacaat tgcctgcggc aacagttcga 

gttctcagat catcacttcg gtggtcagtg ttgtaaaaga gcttattgaa 

atgctggtgc cacaagcgta gatgttaaac tggagaacta tggatttgat 

tgcgagataa cggggagggt atcaaggctg ttgatgcacc tgtaatggca 

40 acacctcaaa aataaatagt catgaagatc ttgaaaattt gacaacttac 

gagaagcctt ggggtcaatt tgttgtatag ctgaggtttt aattacaaca 

ctgataattt tagcacccag tatgttttag atggcagtgg ccacatactt 

cttcacatct tggtcaaggt acaactgtaa ctgctttaag attatttaag 

taagaaagca gttttactca actgcaaaaa aatgtaaaga tgaaataaaa 

45 atctcctcat gagctttggt atccttaaac ctgacttaag gattgtcttt 

aggcagttat ttggcagaaa agcagagtat cagatcacaa gatggctctc 

tggggactgc tgttatgaac aatatggaat cctttcagta ccactctgaa 

tttatctcag tggatttctt ccaaagtgtg atgcagacca ctctttcact 

caccagaaag aagtttcatc ttcataaaca gtcgaccagt acatcaaaaa 

50 agttaatccg acatcattac aatctgaaat gcctaaagga atctactcgt 

ttttctttct gaaaatcgat gttcctacag ctgatgttga tgtaaattta 

aaagccaagt attattacaa aataaggaat ctgttttaat tgctcttgaa 

cgacttgtta tggaccatta cctagtacaa attcttatga aaataataaa 

ccgcagctga catcgttctt agtaaaacag cagaaacaga tgtgcttttt 

55 aatcatctgg aaagaattat tcaaatgttg atacttcagt cattccattc 

tgcataatga tgaatctgga aaaaacactg atgattgttt aaatcaccag 
gtgactttgg ttatggtcat tgtagtagtg aaatttctaa cattgataaa 

atgcatttca ggacatttca atgagtaatg tatcatggga gaactctcag 

gtaaaacttg ttttataagt tccgttaagc acacccagtc agaaaatggc 

60 atatagatga gagtggggaa aatgaggaag aagcaggtct tgaaaactct 

ctgcagatga gtggagcagg ggaaatatac ttaaaaattc agtgggagag 
ctgtgaaaat . tttagtgcct gaaaaaagtt taccatgtaa agtaagtaat 

caatccctga acaaatgaat cttaatgaag attcatgtaa caaaaaatca 
ataataaatc tggaaaagtt acagcttatg atttacttag caatcgagta 



tagcagcaag 
ctcctttcaa 
aactccttgg 
aaaattgagg 
atgaagtact 
ggttttcgtg 
agaacggctg 
tctcagaaac 
aatctacctg 
aagatccaag 
gtacataaca 
atgtcagttc 
gaatctcaga 
agtctttcaa 
gatatcttaa 
ttgtatcctg 
acaccagata 
aatctgatga 
acagatgttt 
aataaagtgg 
caaaatgata 
ataagtattg 
aacactaaga 
acggaatata 
aataaagacc 
tcggaaattt 
aatattgaac 
aataattatc 
aatgtaatag 
atcaagaaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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ccatgtcagc aagtgctctt tttgttcaag 
ctaagactag tttagaggat gcaacactac 
aagaggaaaa actgaaatat gaagagaagg 
aaatgaagag agccattgaa caggagtcac 
5 taaaacccac cagcgcatgg aatttggccc 
atcaaccaaa acttgatgaa ctccttcagt 
ttaaaatggt acagatcccc ttttctatga 
acaaagttga cttagaagag aaggatgaac 
atgcatggct aatgacatcc aaaacagagg 

10 aagccctgct atttaaaaga cttcttgaga 
agccaattat gttaacagag agtcttttta 
aaatgacagc agatgaccaa agatacagtg 
cagcgaatgg tttcaagata aaattgatac 
aaatagaagg aatggctaat tgtctcccat 

15 ttaatgctat attaaacaga aatgcaaagg 
taagttattt agagggagaa gcagtgcgtc 
aagaggacat ccaagacatt atctacagaa 
agtgtgttca tggtcgccca ttttttcatc 
taaatatgtt taagaagatt agttaccatt 

20 tctggtttta aattatcttt gtattatgtg 
ctgacttgtt tttatattga aaaaagttcc 
aac 



atcatcgtcc tcagtttctc atagaaaatc 1860 
aaattgaaga actgtggaag acattgagtg 1920 
ctactaaaga cttggaacga tacaatagtc 1980 
aaatgtcact aaaagatggc agaaaaaaga 2040 
agaagcacaa gttaaaaacc tcattatcta 2100 
cccaaattga aaaaagaagg agtcaaaata 2160 
aaaacttaaa aataaatttt aagaaacaaa 2220^ 
cttgcttgat ccacaatctc aggtttcctg 2280 
taatgttatt aaatccatat agagtagaag 2340 
atcataaact tcctgcagag ccactggaaa 2400 
atggatctca ttatttagac gttttatata 24 60 
gatcaactta cctgtctgat cctcgtctta 2520 
caggagtttc ^attactgaa aattacttgg 2580 
tctatggagt agcagattta aaagaaattc 2640 
aagtttatga atgtagacct cgcaaagtga 2700 
tatccagaca attacccatg tacttatcaa 2760 
tgaagcacca gtttggaaat gaaattaaag 2820 
atttaaccta tcttccagaa actacatgat 2880 
gaaattggtt ctgtcataaa acagcatgag 2940 
tcacatggtt attttttaaa tgaggattca 3000 
acgtattgta gaaaacgtaa ataaactaat 3060 



25 MSH2 (human) (SEQ ID NO:20) 

MAVQPKETLQ LESAAEVGFV RFFQGMPEKP 
QGVIKYMGPA GAKNLQSWL SKMNFESFVK 
YKASPGNLSQ FEDILFGNND MSASIGWGV 
NDQFSNLEAL LIQIGPKECV LPGGETAGDM 

30 LNRLLKGKKG EQMNSAVLPE MENQVAVSSL 
KLDIAAVRAL NLFQGSVEDT TGSQSLAALL 
NLVEAFVEDA ELRQTLQEDL LRRFPDLNRL 
LEKHEGKHQK LLLAVFVTPL TDLRSDFSKF 
LREIMNDLEK KMQSTLISAA RDLGLDPGKQ 

35 TVDIQKNGVK FTNSKLTSLN EEYTKNKTEY 
QLDAWSFAH VSNGAPVPYV RPAILEKGQG 
KQMFHIITGP NMGGKSTYIR QTGVIVLMAQ 
GVSTFMAEML ETASILRSAT KDSLIIIDEL 
ATHFHELTAL ANQIPTVNNL HVTALTTEET 

40 IECAKQKALE LEEFQYIGES QGYDIMEPAA 
EENITIKLKQ LKAEVIAKNN SFVNEIISRI 



TTTVRLFDRG DFYTAHGEDA LLAAREVFKT 60 
DLLLVRQYRV EVYKNRAGNK ASKENDWYLA 120 
KMSAVDGQRQ VGVGYVDSIQ RKLGLCEFPD 180 
GKLRQIIQRG GILITERKKA DFSTBCDIYQD 240 
SAVIKFLELL SDDSNFGQFE LTTFDFSQYM 300 
NKCKTPQGQR LVNQWIKQPL MDKNRIEERL 360 
AKKFQRQAAN LQDCYRLYQG INQLPNVIQA 420 
QEMIETTLDM DQVENHEFLV KPSFDPNLSE 480 
IKLDSSAQFG YYFRVTCKEE KVLRNNKNFS 540 
EEAQDAIVKE IVNISSGYVE PMQTLNDVLA 600 
RIILKASRHA CVEVQDEIAF IPNDVYFEKD 660 
IGCFVPCESA EVSIVDCILA RVGAGDSQLK 720 
GRGTSTYDGF GLAWAISEYI ATKIGAFCMF 780 
LTMLYQVKKG VCDQSFGIHV AELANFPKHV 840 
KKCYLEREQG EKIIQEFLSK VKQMPFTEMS 900 
KVTT 934 



MSH2 (human cDNA) (SEQ ID N0:21) 

ggcgggaaac agcttagtgg gtgtggggtc 

45 gtttcgacat ggcggtgcag ccgaaggaga 
gcttcgtgcg cttctttcag ggcatgccgg 
accggggcga cttctatacg gcgcacggcg 
tcaagaccca gggggtgatc aagtacatgg 
ttgtgcttag taaaatgaat tttgaatctt 

50 atagagttga agtttataag aatagagctg 
atttggcata taaggcttct cctggcaatc 
acaatgatat gtcagcttcc attggtgttg 
agagacaggt tggagttggg tatgtggatt 
tccctgataa tgatcagttc tccaatcttg 

55 aatgtgtttt acccggagga gagactgctg 
aaagaggagg aattctgatc acagaaagaa 
atcaggacct caaccggttg ttgaaaggca 
tgccagaaat ggagaatcag gttgcagttt 
aactcttatc agatgattcc aactttggac 

60 agtatatgaa attggatatt gcagcagtca 
aagataccac tggctctcag tctctggctg 
gacaaagact tgttaaccag tggattaagc 
agagattgaa tttagtggaa gcttttgtag 



gcgcattttc ttcaaccagg aggtgaggag 60 
cgctgcagtt ggagagcgcg gccgaggtcg 120 
agaagccgac caccacagtg cgccttttcg 180 
aggacgcgct gctggccgcc cgggaggtgt 240 
ggccggcagg agcaaagaat ctgcagagtg 300 
ttgtaaaaga tcttcttctg gttcgtcagt 360 
gaaataaggc atccaaggag aatgattggt 420 
tctctcagtt tgaagacatt ctctttggta 480 
tgggtgttaa aatgtccgca gttgatggcc 540 
ccatacagag gaaactagga ctgtgtgaat 600 
aggctctcct catccagatt ggaccaaagg 660 
gagacatggg gaaactgaga cagataattc 720 
aaaaagctga cttttccaca aaagacattt 780 
aaaagggaga gcagatgaat agtgctgtat 840 
catcactgtc tgcggtaatc aagtttttag 900 
agtttgaact gactactttt gacttcagcc 960 
gagcccttaa cctttttcag ggttctgttg 1020 
ccttgctgaa taagtgtaaa acccctcaag 1080 
agcctctcat ggataagaac agaatagagg 1140 
aagatgcaga attgaggcag actttacaag 1200 
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aagatttact tcgtcgattc ccagatctta 

cagcaaactt acaagattgt taccgactct 

tacaggctct ggaaaaacat gaaggaaaac 

ctcctcttac tgatcttcgt tctgacttct 

5 tagatatgga tcaggtggaa aaccatgaat 

tcagtgaatt aagagaaata atgaatgact 

gtgcagccag agatcttggc ttggaccctg 

agtttggata ttactttcgt gtaacctgta 

actttagtac tgtagatatc cagaagaatg 

10 ctttaaatga agagtatacc aaaaataaaa 

ttaaagaaat tgtcaatatt tcttcaggct 

tgttagctca gctagatgct gttgtcagct 

catatgtacg accagccatt ttggagaaag 

ggcatgcttg tgttgaagtt caagatgaaa 

15 aaaaagataa acagatgttc cacatcatta 

atattcgaca aactggggtg atagtactca 

agtcagcaga agtgtccatt gtggactgca 

aattgaaagg agtctccacg ttcatggctg 

ctgcaaccaa agattcatta ataatcatag 

20 atggatttgg gttagcatgg gctatatcag 

gcatgtttgc aacccatttt catgaactta 

ataatctaca tgtcacagca ctcaccactg 

agaaaggtgt ctgtgatcaa agttttggga 

agcatgtaat agagtgtgct aaacagaaag 

25 gagaatcgca aggatatgat atcatggaac 

agcaaggtga aaaaattatt caggagttcc 

aaatgtcaga agaaaacatc acaataaagt 

agaataatag ctttgtaaat gaaatcattt 

cagtaatgga atgaaggtaa tattgataag 

30 atattaaccc tttttccata gtgttaactg 

atatttagta atattttact ttgaggacat 

gctgtaactg aggactgttt gcaattgaca 

ataaataaaa tcatgtagtt tgtgg 



accgacttgc caagaagttt caaagacaag 1260 
atcagggtat aaatcaacta cctaatgtta 1320 
accagaaatt attgttggca gtttttgtga 1380 
ccaagtttca ggaaatgata gaaacaactt 1440 
tccttgtaaa accttcattt gatcctaatc 1500 
tggaaaagaa gatgcagtca acattaataa 1560 
gcaaacagat taaactggat tccagtgcac 1620 
aggaagaaaa agtccttcgt aacaataaaa 1680 
gtgttaaatt taccaacagc aaattgactt 1740 
cagaatatga agaagcccag gatgccattg 1800 
atgtagaacc aatgcagaca ctcaatgatg 1860 
ttgctcacgt gtcaaatgga gcacctgttc 1920 
gacaaggaag aattatatta aaagcatcca 1980 
ttgcatttat tcctaatgac gtatactttg 2040 
ctggccccaa tatgggaggt aaatcaacat 2100 
tggcccaaat tgggtgtttt gtgccatgtg 2160 
tcttagcccg agtaggggct ggtgacagtc 2220 
aaatgttgga aactgcttct atcctcaggt 2280 
atgaattggg aagaggaact tctacctacg 2340 
aatacattgc aacaaagatt ggtgcttttt 2400 
ctgccttggc caatcagata ccaactgtta 2460 
aagagacctt aactatgctt tatcaggtga 2520 
ttcatgttgc agagcttgct aatttcccta 2580 
ccctggaact tgaggagttt cagtatattg 2640 
cagcagcaaa gaagtgctat ctggaaagag 2700 
tgtccaaggt gaaacaaatg ccctttactg 2760 
taaaacagct aaaagctgaa gtaatagcaa 2820 
cacgaataaa agttactacg tgaaaaatcc 2880 
ctattgtctg taatagtttt atattgtttt 2940 
tcagtgccca tgggctatca acttaataag 3000 
tttcaaagat ttttattttg aaaaatgaga 3060 
taggcaataa taagtgatgt gctgaatttt 3120 

3145 



35 MLH1 (human) (SEQ ID NO:22) 

MSFVAGVIRR LDETWNRIA AGEVTQRPAN 

IQDNGTGIRK EDLDIVCERF TTSKLQSFED 

DGKCAYRASY SDGKLKAPPK PCAGNQGTQI 

GRYSVHNAGI SFSVKKQGET VADVRTLPNA 

40 KMNGYISNAN YSVKKCIFLL FINHRLVEST 

QNVDVNVHPT KHEVHFLHEE SILERVQQHI 

KSTTSLTSSS TSGSSDKVYA HQMVRTDSRE 

SGRARQQDEE MLELPAPAEV AAKNQSLEGD 

MVEDDSRKEM TAACTPRRRI INLTSVLSLQ 

45 AQHQTKLYLL NTTKLSEELF YQILIYDFAN 

DGPKEGLAEY IVEFLKKKAE MLADYFSLEI 
ATEVNWDEEK ECFESLSKEC AMFYSIRKQY 

YKALRSHILP PKHFTEDGNI LQLANLPDLY 

50 MLH1 (human) (SEQ ID NO:23) 

cttggctctt ctggcgccaa aatgtcgttc 

acagtggtga accgcatcgc ggcgggggaa 

gagatgattg agaactgttt agatgcaaaa 

ggaggcctga agttgattca gatccaagac 

55 gatattgtat gtgaaaggtt cactactagt 

atttctacct atggctttcg aggtgaggct 

actattacaa cgaaaacagc tgatggaaag 

aaactgaaag cccctcctaa accatgtgct 

gacctttttt acaacatagc cacgaggaga 

60 gggaaaattt tggaagttgt tggcaggtat 

gttaaaaaac aaggagagac agtagctgat 

gacaatattc gctccatctt tggaaatgct 

gaggataaaa ccctagcctt caaaatgaat 



AIKEMIENCL DAKSTSIQVI VKEGGLKLIQ 60 
LASISTYGFR GEALASISHV AHVTITTKTA 120 
TVEDLFYNIA TRRKALKNPS EEYGKILEW 180 
STVDNIRSIF GNAVSRELIE IGCEDKTLAF 240 
SLRKAIETVY AAYLPKNTHP FLYLSLEISP 300 
ESECLLGSNSS RMYFTQTLLP GLAGPSGEMV 360 
QKLDAFLQPL SKPLSSQPQA IVTEDKTDIS 420 
TTKGTSEMSE KRGPTSSNPR KRHREDSDVE 480 
EEINEQGHEV LREMLHNHSF VGCVNPQWAL 540 
FGVLRLSEPA PLFDLAMLAL DSPESGWTEE 600 
DEEGNLIGLP LLIDNYVPPL EGLPIFILRL 660 
ISEESTLSGQ QSEVPGSIPN SWKWTVEHIV 720 
KVFERC 756 



gtggcagggg ttattcggcg gctggacgag 60 

gttatccagc ggccagctaa tgctatcaaa 120 

tccacaagta ttcaagtgat tgttaaagag 180 

aatggcaccg ggatcaggaa agaagatctg 240 

aaactgcagt cctttgagga tttagccagt 300 

ttggccagca taagccatgt ggctcatgtt 360 

tgtgcataca gagcaagtta ctcagatgga 420 

ggcaatcaag ggacccagat cacggtggag 480 

aaagctttaa aaaatccaag tgaagaatat 540 

tcagtacaca atgcaggcat tagtttctca 600 

gttaggacac tacccaatgc ctcaaccgtg 660 

gttagtcgag aactgataga aattggatgt 720 

ggttacatat ccaatgcaaa ctactcagtg 780 



WO 02/054856 



44 



PCT/US01/00934 



aagaagtgca tcttcttact cttcatcaac catcgtctgg tagaatoaac ttccttgaga 840 
aaagccatag aaacagtgta tgcagcctat ttgcccaaaa acacacaccc attcctgtac 900 
ctcagtttag aaatcagtcc ccagaatgtg gatgttaatg tgcaccccac aaagcatgaa 960 
gttcacttcc tgcacgagga gagcatcctg gagcgggtgc agcagcacat cgagagcaag lozo 

5 ctcctgggct ccaattcotc caggatgtac ttcacccaga ctttgctacc aggacttgct 1080 
ggcccctctg gggagatggt taaatccaca acaagtctga cctcgtottc tacttotgga 1140 
agtagtgata aggtctatgc ccaccagatg gttcgtacag attcccggga acagaagctt 1200 
gatgcatttc tgcagcctct gagcaaaccc ctgtccagtc agccccaggc cattgtcaca 1260 
gaggataaga cagatatttc tagtggcagg gctaggcagc aagatgagga gatgcttgaa 1320 

10 ctcccagccc ctgctgaagt ggctgccaaa aatcagagct tggaggggga tacaacaaag 1380 
gggacttcag aaatgtcaga gaagagagga cctacttcca gcaaccccag aaagagacat 1440 
cgggaagatt ctgatgtgga aatggtggaa gatgattccc gaaaggaaat gactgcagct 1500 
tgtacccccc ggagaaggat cattaacctc actagtgttt tgagtctcca ggaagaaatt 1560 
aatgagcagg gaoatgaggt tctccgggag atgttgcata accactcctt cgtgggctgt 1620 

15 gtglatcctc agtgggcctt ggcacagcat caaaccaagt tataccttct caacaccacc 1680 
lagcttagtg aagaactgtt ctaccagata ctcatttatg attttgccaa ttttggtgtt 1740 
ctcaggttat cggagccagc accgctcttt gaccttgcca tgcttgcctt agatagtcca 1800 
gagaqtggct ggacagagga agatggtccc aaagaaggac ttgctgaata cattgttgag 1860 
tttctgaaga agaaggctga gatgcttgca gactatttct ctttggaaat tgatgaggaa 1920 

20 gggaacctga ttggattacc ccttctgatt gacaactatg tgcccccttt ggagggactg 1980 
cctatcttca ttcttcgact agccactgag gtgaattggg acgaagaaaa ggaatgtttt 2040 
gaaagcctca gtaaagaatg cgctatgttc tattccatcc ggaagcagta catatctgag 2100 
qagtcgaocc tctcaggcca gcagagtgaa gtgcctggct ccattccaaa ctcctggaag 2160 
tggactgtgg aacacattgt ctataaagcc ttgcgctcac acattctgcc tcctaaacat 2220 

25 ttcacagaag atggaaatat cctgcagctt gctaacotgc ctgatctata caaagtcttt 2280 
gagaggtgtt aaatatggtt atttatgcac tgtgggatgt gttcttcttt ctctgtattc 2340 
cgatacaaag tgttgtatca aagtgtgata tacaaagtgt accaacataa gtgttggtag 2400 
clcttaagal ttatacttgo cttctgatag tattccttta tacacagtgg attgattata 2460 



30 



35 



aataaataga tgtgtcttaa cata 2484 
hPMS2-134 (human) (SEQ ID NO:24) 

MERAESSSTE PAKAIKPIDR KSVHQICSGQ WLSLSTAVK ELVENSLDAG ATNIDLKLKD 60 
YGVDLIEVSD NGCGVEEENF EGLTLKHHTS KIQEFADLTQ VETFGFRGEA LSSLCALSDV 120 
TISTCHASAK VGT 133 

hPMS2-134 (human cDNA) (SEQ ED NO:25) 

cgaggcggat cgggtgttgc atccatggag cgagctgaga gctcgagtac agaacctgct 60 

aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 120 

ctgagtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 180 

40 aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 240 

totggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 300 

caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 360 

tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 420 
acttga ^ 



45 



GTBP (human) (SEQ ID NO:26) 



MSRQSTLYSF FPKSPALSDA NKASARASRE GGRAAAAPGA SPSPGGDAAW SEAGPGPRPL 60 
ARSASPPKAK NLNGGLRRSV APAAPTSCDF SPGDLVWAKM EGYPWWPCLV YNHPFDGTFI 120 
REKGKSVRVH VQFFDDSPTR GWVSKRLLKP YTGSKSKEAQ KGGHFYSAKP EILRAMQRAD 180 

50 EALNKDK1KR LELAVCDEPS EPEEEEEMEV GTTYVTDKSE EDNEIESEEE VQPKTQGSRR 240 
SSROIKKRRV ISDSESD1GG SDVEFKPDTK EEGSSDEISS GVGDSESEGL NSPVKVARKR 300 
KRMVTGNGSL KRKSSRKETP SATKQATSIS SETKNTLRAF SAPQNSESQA HVSGGGDDSS 360 
RPTVWYHETL EWLKEEKRRD EHRRRPDHPD FDASTLYVPE DFLNSCTPGM RKWWQ1KSQN 420 
FDLVICYKVG KFYELYHMDA LIGVSELGLV FMKGNWAHSG FPEIAFGRYS DSLVQKGYKV 480 

55 ARVEQTETPE MMEARCRKMA H1SKYDRWR REICRIITKG TQTYSVLEGD PSENYSKYLL 540 
SLKEKEEDSS GHTRAYGVCF VDTSLGKFFI GQFSDDRHCS RFRTLVAHYP PVQVLFEKGN 600 
LSKETKTILK SSLSCSLQEG LIPGSQFWDA SKTLRTLLEE EYFREKLSDG IGVMLPQVLK 660 
GMTSESDSIG LTPGEKSELA LSALGGCVFY LKKCLIDQEL LSMANFEEYI PLDSDTVSTT 720 
RSGAIFTKAY QRMVLDAVTL NNLEIFLNGT NGSTEGTLLE RVDTCHTPFG KRLLKQWLCA 780 

60 PLCNHYAIND RLDAIEDLMV VPDKISEWE LLKKLPDLER LLSKIHNVGS PLKSQNHPDS 840 
RAIMYEETTY SKKKIIDFLS ALEGFKVMCK IIGIMEEVAD GFKSKILKQV ISLQTKNPEG 900 
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RFPDLTVELN RWDTAFDHEK ARKTGLITPK 
RIGCRTIVYW GIGRNRYQLE IPENFTTRNL 
AEERRDVSLK DCMRRLFYNF DKNYKDWQSA 
LPEDTPPFLE LKGSRHPCIT KTFFGDDFIP 
5 STLMRQAGLL AVMAQMGCYV PAEVCRLTPI 
LMHATAHSLV LVDELGRGTA TFDGTAIANA 
AVRLGHMACM VENECEDPSQ ETITFLYKFI 
REFEKMNQSL RLFREVCLAS ERSTVDAEAV 



AGFDSDYDQA LADIRENEQS LLEYLEKQRN 960 
PEEYELKSTK KGCKRYWTKT IEKKLANLIN 1020 
VECIAVLDVL LCLANYSRGG DGPMCRPVIL 1080 
NDILIGCEEE EQENGKAYCV LVTGPNMGGK 1140 
DRVFTRLGAS DRIMSGESTF FVELSETASI 1200 
WKELAETIK CRTLFSTHYR SLVEDYSQNV 1260 
KGACPKSYGF NAARLANLPE EVIQKGHRKA 1320 
HKLLTLIKEL 1360 



10 GTBP (human cDNA) (SEQ ID NO:27) 



20 



gccgcgcggt 
ggctgtcggt 
gagtgatgcc 
ccccggggcc 
15 caggcccttg 
gagatcggta 
ggccaagatg 
aacattcatc 
cccaacaagg 
ggaagcccag 
acgtgcagat 
tgagccctca 
taagagtgaa 
atctaggcga 
25 cattggtggc 
aataagcagt 
tcgaaagcgg 
ggaaacgccc 
gagagctttc 
30 tgacagtagt 
gagaagagat 
tgtgcctgag 
gtctcagaac 
catggatgct 
35 ccattctggc 
ctataaagta 
aaagatggca 
taccaagggt 
gtatcttctt 
40 tgtgtgcttt 
ccattgttcg 
aaaaggaaat 
tcaggaaggt 
ccttgaggaa 
45 ggtgcttaaa 
tgaattggcc 
tcaggagctt 
cagcactaca 
agtgacatta 
50 cctactagag 
gctttgtgcc 
cctcatggtt 
tcttgagagg 
cccagacagc 
55 ttttctttct 
agttgctgat 
tcctgaaggt 
ccatgaaaag 
tgaccaagct 
60 acagcgcaac 
ccagctggaa 
atctaccaag 
tctcataaat 
ctataacttt 



agatgcggtg 
atgtcgcgac 
aacaaggcct 
tctccttccc 
gcgcgctccg 
gcgcctgctg 
gagggttacc 
cgcgagaaag 
ggctgggtta 
aagggaggtc 
gaagccttaa 
gagccagaag 
gaagataatg 
agtagccgcc 
tctgatgtgg 
ggagtggggg 
aagagaatgg 
tcagccacca 
tctgcccctc 
cgccctactg 
gagcacagga 
gatttcctca 
tttgatcttg 
cttattggag 
tttcctgaaa 
gcacgagtgg 
catatatcca 
acacagactt 
agcctcaaag 
gttgatactt 
agatttagga 
ctctcaaagg 
ctgatacccg 
gaatatttta 
ggtatgactt 
ctctctgctc 
ttatcaatgg 
agatctggtg 
aacaacttgg 
agggttgata 
ccactctgta 
gtgcctgaca 
ctactcagta 
agggctataa 
gctctggaag 
ggttttaagt 
cgttttcctg 
gctcgaaaga 
cttgctgaca 
agaattggct 
attcctgaga 
aagggctgta 
gctgaagaac 
gataaaaatt 



cttttaggag 

agagcaccct 

cggccagggc 

caggcgggga 

cgtcaccgcc 

cccccaccag 

cctggtggcc 

ggaaatcagt 

gcaaaaggct 

atttttacag 

ataaagacaa 

aggaagaaga 

aaattgagag 

aaataaaaaa 

aatttaagcc 

atagtgagag 

tgactggaaa 

aacaagcaac 

aaaattctga 

tttggtatca 

ggaggcctga 

attcttgtac 

tcatctgtta 

tcagtgaact 

ttgcatttgg 

aacagactga 

agtatgatag 

acagtgtgct 

aaaaagagga 

cactgggaaa 

ctctagtggc 

aaactaaaac 

gctcccagtt 

gggaaaagct 

cagagtctga 

taggtggttg 

ctaattttga. 

ctatcttcac . 

agatttttct 

cttgccatac 

accattatgc 

aaatctccga 

aaattcataa 

tgtatgaaga 

gattcaaagt 

ctaaaatcct 

atttgactgt 

ctggacttat 

taagagaaaa 

gtaggaccat 

atttcaccac 

aacgatactg 

ggagggatgt 

acaaggactg 



ctccgtccga 
gtacagcttc 
ctcacgcgaa 
tgcggcctgg 
caaggcgaag 
ttgtgacttc 
ttgtctggtt 
ccgtgttcat 
tttaaagcca 
tgcaaagcct 
gattaagagg 
gatggaggta 
tgaagaggaa 
acgaagggtc 
agacactaag 
tgaaggcctg 
tggctctctt 
tagcatttca 
atcccaagcc 
tgaaacttta 
tcaccccgat 
tcctgggatg 
caaggtgggg 
ggggctggta 
ccgttattca 
gactccagaa 
agtggtgagg 
ggaaggtgat 
agattcttct 
gtttttcata 
acactatccc 
aattctaaag 
ttgggatgca 
aagtgatggc 
ttccattggg 
tgtcttctac 
agaatatatt 
caaagcctat 
gaatggaaca 
tccttttggt 
tattaatgat 
agttgtagag 
tgttgggtct 
aactacatac 
aatgtgtaaa 
taagcaggtc 
agaattgaac 
tactcccaaa 
tgaacagagc 
agtctattgg 
tcgcaatttg 
gaccaaaact 
atcattgaag 
gcagtctgct 



cagaacggtt 
ttccccaagt 
ggcggccgtg 
agcgaggctg 
aacctcaacg 
tcaccaggag 
tacaaccacc 
gtacagtttt 
tatacaggtt 
gaaatactga 
cttgaattgg 
ggcacaactt 
gtacagccta 
atatcagatt 
gaggaaggaa 
aacagccctg 
aaaaggaaaa 
tcagaaacca 
cacgttagtg 
gaatggctta 
tttgatgcat 
aggaagtggt 
aaattttatg 
ttcatgaaag 
gattccctgg 
atgatggagg 
agggagatct 
ccctctgaga 
ggccatactc 
ggtcagtttt 
ccagtacaag 
agttcattgt 
tccaaaactt 
attggggtga 
ttgacaccag 
ctcaaaaaat 
cccttggatt 
caacgaatgg 
aatggttcta 
aagcggctcc 
cgtctagatg 
cttctaaaga 
cccctgaaga 
agcaagaaga 
attataggga 
atctctctgc 
cgatgggata 
gcaggctttg 
ctcctggaat 
gggattggta 
ccagaagaat 
attgaaaaga 
gactgcatgc 
gtagagtgta 



gggccttgcc 
ctccggcgct 
ccgccgctgc 
ggcctgggcc 
gagggctgcg 
atttggtttg 
cctttgatgg 
ttgatgacag 
caaaatcaaa 
gagcaatgca 
cagtttgtga 
acgtaacaga 
agacacaagg 
ctgagagtga 
gcagtgatga 
tcaaagttgc 
gctctaggaa 
agaatacttt 
gaggtggtga 
aggaggaaaa 
ctacactcta 
ggcagattaa 
agctgtacca 
gcaactgggc 
tgcagaaggg 
cacgatgtag 
gtaggatcat 
actacagtaa 
gtgcatatgg 
cagatgatcg 
ttttatttga 
cctgttctct 
tgagaactct 
tgttacccca 
gagagaaaag 
gccttattga 
ctgacacagt 
tgctagatgc 
ctgaaggaac 
taaagcaatg 
ccatagaaga 
agcttccaga 
gtcagaacca 
agattattga 
tcatggaaga 
agacaaaaaa 
cagcctttga 
actctgatta 
acctagagaa 
ggaaccgtta 
acgagttgaa 
agttggctaa 
ggcgactgtt 
tcgcagtgtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2O40 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 
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ggatgtttta ctgtgcctgg ctaactatag 
agtaattctg ttgccggaag ataccccccc 
ttgcattacg aagacttttt ttggagatga 
tgaggaagag gagcaggaaa atggcaaagc 
5 ggggggcaag tctacgctta tgagacaggc 
ttgttacgtc cctgctgaag tgtgcaggct 
tggtgcctca gacagaataa tgtcaggtga 
tgccagcata ctcatgcatg caacagcaca 
aggtactgca acatttgatg ggacggcaat 

10 gactataaaa tgtcgtacat tattttcaac 
tcaaaatgtt gctgtgcgcc taggacatat 
ccccagccag gagactatta cgttcctcta 
ctatggcttt aatgcagcaa ggcttgctaa 
tagaaaagca agagaatttg agaagatgaa 

15 cctggctagt gaaaggtcaa ctgtagatgc 
taaggaatta tagactgact acattggaag 
attcagacaa cattatgatc taataaactt 



tcgagggggt gatggtccta tgtgtcgccc 3300 
cttcttagag cttaaaggat cacgccatcc 3360 
ttttattcct aatgacattc taataggctg 3420 
ctattgtgtg cttgttactg gaccaaatat 3480 
tggcttatta gctgtaatgg cccagatggg 3540 
cacaccaatt gatagagtgt ttactagact 3600 
aagtacattt tttgttgaat taagtgaaac 3660 
ttctctggtg cttgtggatg aattaggaag 3720 
agcaaatgca gttgttaaag aacttgctga 3780 
tcactaccat tcattagtag aagattattc 3840 
ggcatgcatg gtagaaaatg aatgtgaaga 3900 
taaattcatt aagggagctt gtcctaaaag 3960 
tctcccagag gaagttattc aaaagggaca 4020 
tcagtcacta cgattatttc gggaagtttg 4080 
tgaagctgtc cataaattgc tgactttgat 4140 
ctttgagttg acttctgaca aaggtggtaa 4200 
tattttttaa aaat * " " 4244 



MSH3 (human) (SEQ ID NO:28) 

20 MSRRKPASGG LAASSSAPAR QAVLSRFFQS 
AFPPQLPPHV ATEIDRRKKR PLENDGPVKK 
VSKSLEKLKE FCCDSALPQS RVQTESLQER 
INQKDTTLFD LSQFGSSNTS HENLQKTASK 
VECGYKYRFF GEDAEIAARE LNIYCHLDHN 

25 TETAALKAIG DNRSSLFSRK LTALYTKSTL 
CISENKENVR DKKKGNIFIG IVGVQPATGE 
SALSEQTEAL IHRATSVSVQ DDRIRVERMD 
SGIVNLEKPV ICSLAAIIKY LKEFNLEKML 
NQTDMKTKGS LLWVLDHTKT SFGRRKLKKW 

30 QIENHLRKLP DIERGLCSIY HKKCSTQEFF 
TVTLEIPELL SPVEHYLKIL NEQAAKVGDK 
LQEIRKILKN PSAQYVTVSG QEFMIEIKNS 
RHLNQLREQL VLDCSAEWLD FLEKFSEHYH 
TVQEERKIVI KNGRHPVIDV LLGEQDQYVP 

35 ALITIMAQIG SYVPAEEATI GIVDGIFTRM 
SLVILDELGR GTSTHDGIAI AYATLEYFIR 
MGFLVSEDES KLDPGAAEQV PDFVTFLYQI 
SKELEGLINT KRKRLKYFAK LWTMHNAQDL 

40 MSH3 (human DNA) (SEQ ID NO:29) 

gggcacgagc cctgccatgt ctcgccggaa 
ctcagcccct gcgaggcaag cggttttgag 
atccacctcc tcctccacag gtgcagccga 
agcgccccca gcgcccgcct tcccgcccca 

45 cagaagaaag aagagaccat tggaaaatga 
ccaacaaaag gaaggaggaa gtgatctggg 
tctgaggacc aggaatgttt caaagtctct 
tgcccttcct caaagtagag tccagacaga 
aaaatgtact gattttgatg atatcagtct 

50 agattcgaaa cgtcaaatta atcaaaagga 
atcatcaaat acaagtcatg aaaatttaca 
gtccaaaagc atctatacgc cgctagaatt 
agatgcagtt ttgtgtgtgg aatgtggata 
gattgcagcc cgagagctca atatttattg 

55 tatacctact cacagactgt ttgttcatgt 
gggagttgtg aagcaaactg aaactgcagc 
actcttttcc cggaaattga ctgcccttta 
gaatccccta atcaagctgg atgatgctgt 
taccagctat cttctgtgca tctctgaaaa 

60 caacattttt attggcattg tgggagtgca 
tttccaggac tctgcttctc gttcagagct 
agagctgctg cttccttcgg ccttgtccga 
atctgttagt gtgcaggatg acagaattcg 



TGSLKSTSSS TGAADQVDPG AAAAAAPPAP 60 
KVKKVQQKEG GSDLGMSGNS EPKKCLRTRN 120 
FAVLPKCTDF DDISLLHAKN AVSSEDSKRQ 180 
SANKRSKSIY TPLELQYIEM KQQHKDAVLC 240 
FMTASIPTHR LFVHVRRLVA KGYKVGWKQ 300 
IGEDVNPLIK LDDAVNVDEI MTDTSTSYLL 360 
WFDSFQDSA SRSELETRMS SLQPVELLLP 420 
NIYFEYSHAF QAVTEFYAKD TVDIKGSQII 480 
SKPENFKQLS SKMEFMTING TTLRNLEILQ 540 
VTQPLLKLRE INARLDAVSE VLHSESSVFG 600 
LIVKTLYHLK SEFQAIIPAV NSHIQSDLLR 660 
TELFKDLSDF PLIKKRKDEI QGVIDEIRMH 720 
AVSCIPTDWV KVGSTKAVSR FHSPFIVENY 780 
SLCKAVHHLA TVDCIFSLAK VAKQGDYCRP 840 
NNTDLSEDSE RVMIITGPNM GGKSS YIKQV 900 
GAADNIYKGR STFMEELTDT AEIIRKATSQ 960 
DVKSLTLFVT HYPPVCELEK NYSHQVGNYH 1020 
TRGIAARSYG LNVAKLADVP GEILKKAAHK 1080 
QKWTEEFNME ETQTSLLH 1128 



gcctgcgtcg ggcggcctcg ctgcctccag 60 
ccgattcttc cagtctacgg gaagcctgaa 120 
ccaggtggac cctggcgctg cagcggccgc 180 
gctgccgccg cacgtagcta cagaaattga 240 
tgggcctgtt aaaaagaaag taaagaaagt 300 
aatgtctggc aactctgagc caaagaaatg 360 
ggaaaaattg aaagaattct gctgcgattc 420 
atctctgcag gagagatttg cagttctgcc 480 
tctacacgca aagaatgcag tttcttctga 540 
cacaacactt tttgatctca gtcagtttgg 600 
gaaaactgct tccaaatcag ctaacaaacg 660 
acaatacata gaaatgaagc agcagcacaa 720 
taagtataga ttctttgggg aagatgcaga 780 
ccatttagat cacaacttta tgacagcaag 840 
acgccgcctg gtggcaaaag gatataaggt 900 
attaaaggcc attggagaca acagaagttc 960 
tacaaaatct acacttattg gagaagatgt 1020 
aaatgttgat gagataatga ctgatacttc 1080 
taaggaaaat gttagggaca aaaaaaaggg 1140 
gcctgccaca ggcgaggttg tgtttgatag 1200 
agaaacccgg atgtcaagcc tgcagccagt 1260 
gcaaacagag gcgctcatcc acagagccac 1320 
agtcgaaagg atggataaca tttattttga 1380 



WO 02/054856 



47 



PCT/US01/00934 



atacagccat gctttccagg cagttacaga 
aggttctcaa attatttctg gcattgttaa 
tgccatcata aaatacctca aagaattcaa 
ttttaaacag ctatcaagta aaatggaatt 
5 tctggaaatc ctacagaatc agactgatat 
agaccacact aaaacttcat ttgggagacg 
ccttaaatta agggaaataa atgcccggct 
atctagtgtg tttggtcaga tagaaaatca 
actctgtagc atttatcaca aaaaatgttc 

10 tttatatcac ctaaagtcag aatttcaagc 
gtcagacttg ctccggaccg ttattttaga 
ttacttaaag atactcaatg aacaagctgc 
agacctttct gacttccctt taataaaaaa 
cgagatccga atgcatttgc aagaaatacg 

15 tgtgacagta tcaggacagg agtttatgat 
accaactgat tgggtaaagg ttggaagcac 
tattgtagaa aattacagac atctgaatca 
tgctgaatgg cttgattttc tagagaaatt 
agtgcatcac ctagcaactg ttgactgcat 

20 agattactgc agaccaactg tacaagaaga 
ccctgtgatt gatgtgttgc tgggagaaca 
atcagaggac tcagagagag taatgataat 
ctacataaaa caagttgcat tgattaccat 
agaagaagcg acaattggga ttgtggatgg 

25 tatatataaa ggacggagta catttatgga 
aaaagcaaca tcacagtcct tggttatctt 
tgatggaatt gccattgcct atgctacact 
aaccctgttt gtcacccatt atccgccagt 
ggtggggaat taccacatgg gattcttggt 

30 cgcagcagaa caagtccctg attttgtcac 
agcaaggagt tatggattaa atgtggctaa 
gaaagcagct cacaagtcaa aagagctgga 
caagtatttt gcaaagttat ggacgatgca 
ggagttcaac atggaagaaa cacagacttc 

35 tgaacaaaaa atggagaatt aaaaatacca 
tatctttgtg tgacatgtga gcataaaatt 
agaggttttt ctgaagacag tctttttcaa 
aacactcttg aatagacttc cactttgtaa 
aaagccttaa gtggcagaat ataattccca 

40 tgatattttt atttgtttca gttcagataa 
atccattgaa ctaaaataat tttattatgc 
tttttataag tagaaagaat tggccaggca 
tgggaggcca aggtaggcag atcacctgag 
tggcaaaacc ccatctttac taaaaatata 

45 ttagctgggc atggtggcgc acacctgtag 
aatctcttga acctgggagg cggaggttgc 
gcttgggcaa cagagcaaga ctccatctca 
caagctttta aaaactagag cacagaagga 
ttgtcatagg attaagcagt ttaaagattg 

50 taataaatat ttaatgaata cttgctataa 



gttttatgca aaagatacag ttgacatcaa 1440 
cttagagaag cctgtgattt gctctttggc 1500 
cttggaaaag atgctctcca aacctgagaa 1560 
tatgacaatt aatggaacaa cattaaggaa 1620 
gaaaaccaaa ggaagtttgc tgtgggtttt 1680 
gaagttaaag aagtgggtga cccagccact 1740 
tgatgctgta tcggaagttc tccattcaga 1800 
tctacgtaaa ttgcccgaca tagagagggg 1860 
tacccaagag ttcttcttga ttgtcaaaac 1920 
aataatacct gctgttaatt cccacattca 1980 
aattcctgaa ctcctcagtc cagtggagca 2040 
caaagttggg gataaaactg aattatttaa 2100 
gaggaaggat gaaattcaag gtgttattga 2160 
aaaaatacta aaaaatcctt ctgcacaata 2220 
agaaataaag aactctgctg tatcttgtat 2280 
aaaagctgtg agccgctttc actctccttt 2340 
gctccgggag cagctagtcc ttgactgcag 2400 
cagtgaacat tatcactcct tgtgtaaagc 2460 
tttctccctg gccaaggtcg ctaagcaagg 2520 
aagaaaaatt gtaataaaaa atggaaggca 2580 
ggatcaatat gtcccaaata atacagattt 2640 
taccggacca aacatgggtg gaaagagctc 2700 
catggctcag attggctcct atgttcctgc 2760 
cattttcaca aggatgggtg ctgcagacaa 2820 
agaactgact gacacagcag aaataatcag 2880 
ggatgaacta ggaagaggga cgagcactca 2940 
tgagtatttc atcagagatg tgaaatcctt 3000 
ttgtgaacta gaaaaaaatt actcacacca 3060 
cagtgaggat gaaagcaaac tggatccagg 3120 
cttcctttac caaataacta gaggaattgc 3180 
actagcagat gttcctggag aaattttgaa 324 0 
aggattaata aatacgaaaa gaaagagact 3300 
taatgcacaa gacctgcaga agtggacaga 3360 
tcttcttcat taaaatgaag actacatttg 3420 
actgtacaaa ataactctcc agtaacagcc 3480 
atgaccatgg tatattccta ttggaaacag 3540 
gtttctgtct tcctaacttt tctacgtata 3600 
ttagaaaatt ttatggacag taagtccagt 3660 
agcttttgga gggtgatata aaaatttact 3720 
ttggcaactg ggtgaatctg gcaggaatct 3780 
aaccagttta tccaccaaga acataagaat 3840 
tggtggctca tgcctgtaat cccagcactt 3900 
gtcaggagtt caagaccagc ctggccaaca 3960 
aagtacatct ctactaaaaa tacgaaaaaa 4020 
tcccagctac tccggaggct gaggcaggag 4080 
aatgagccga gatcacgtca ctgcactcca 4140 
aaaaagaaaa aagaaaagaa atagaattat 4200 
ataaggtcat gaaatttaaa aggttaaata 4260 
ttggatgaaa ttatttgtca ttcattcaag 4320 
aaaaaaaaaa aaaaaaaaaa aaaa 4374 



Each reference cited herein is hereby incorporated by reference in its entirety. 
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We claim: 

1 . A method for making a hypermutable cell comprising exposing a cell to an inhibitor 
of mismatch repair, wherein said inhibitor is an anthracene, an ATPase inhibitor, a nuclease 
inhibitor, a polymerase inhibitor, or an antisense oligonucleotide that specifically hybridizes 
to a nucleotide encoding a mismatch repair protein. 

2. The method of claim 1 wherein said inhibitor is an anthracene. 

3. The method of claim 2 wherein said anthracene has the formula: 




whereinR r R 10 are independently hydrogen, hydroxyl, amino, alkyl, substituted alkyl, alkenyl, 
substituted alkenyl, alkynyl, substituted alkynyl, O-alkyl, S-alkyl, N-alkyl, O-alkenyU S- 
alkenyl, N-alkenyl.O-alkynyL S-alkynyl, N-alkynyl aryl, substituted aryl, aryloxy, substituted 
aryloxy, heteroaryl, substituted heteroaryl, aralkyioxy, aiylalkyl, alkylaryl, alkylaryloxy, 
arylsulfonyl, alkylsulfonyl, alkoxycarbonyl, aryloxycarbonyl, guanidino, carboxy, an alcohol, 
an amino acid, sulfonate, alkyl sulfonate, CN, NO* an aldehyde group, an ester, an ether, a 
crown ether, a ketone, an organosulfur compound, an organometallic group, a carboxylic acid, 
an organosilicon or a carbohydrate that optionally contains one or more alkylated hydroxyl 
groups; 

wherein said heteroaJkyl, heteroaryl, and substituted heteroaryl contain at least one 
heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said substituents of said substituted alkyl, substituted alkenyl, substituted 

alkynyl, 

substituted aryl, and substituted heteroaryl are halogen, CN, NO, lower alkyl, aryl, heteroaryl, 
aralkyl, aralkyioxy, guanidino, alkoxycarbonyl, alkoxy, hydroxy, carboxy and amino; 

and wherein said amino groups optionally substituted with an acyl group, or 1 to 3 aryl 
or lower alkyl groups. 
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4. The method of claim 3 wherein R5 and R« are hydrogen. 

5. The method of claim 3 wherein Rj-Rio are independently hydrogen, hydroxyl, alkyl, 
aryl, arylaklyl, or hydroxyalkyl. 

6. The method of claim 3 wherein R r R l0 are independently hydrogen, hydroxyl, methyl, 
ethyl, propyl, isopropyl, butyl, isobutyl, phenyl, tolyl, hydroxymethyl, hydroxypropyl, or 
hydroxybutyl. 

7. The method of claim 3 wherein said anthracene is selected from the group consisting 
of 1 ,2-dimethylanthracene, 9,10-dimethyl anthracene, 7,8-dimethylanthracene, 9,10- 
diphenylanthracene, 9,10-dihydroxymethylanthracene, 9-hydroxymethyl-10- 
methylanthracene, dimethylanthracene-1 ,2-diol, 9-hydroxymethyl- 1 0-methylanthracene- 1 ,2- 
diol, 9-hydroxymethyl-10-methylanthracene-3,4-diol, and 9, 10-di-m-tolyanthracene. 

8. The method of claim 3 wherein R 3 , R,, Rs, R«, R 7 , R^ R9 and R 10 are hydrogen. 

9. The method of claim 3 wherein R X9 R^, R 3 , R^ R* R«, R 7 and R 8 are hydrogen. 

10. The method of claim 3 wherein R 1 , R^, R 3 , R* Rs> R 7 and R« are hydrogen. 

11. The method of claim 3 wherein R„ R^, R 3 , R^ R5, R*> R* and Rio are hydrogen. 

12. The method of claim 3 wherein R„ R 3 , R«, Rj, Re, R 7 and Rg are hydrogen. 

13. The method of claim 3 wherein R„ R* R 3 , R,, Rj, R«, R 7 , R« a* d R, 0 are hydrogen. 

14. The method of claim 1 wherein said ATPase inhibitor is nonhydrolyzable forms of 
ATP such as AMP-PNP. 

15. The method of claim 1 wherein said a nuclease inhibitor is an analog of N- 
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Emylmaleimide, a heterodimeric adenine^hain-acridine compounds, or a quinilone such 
as Heliquinomycin. 

16. The method of claim 1 wherein said polymerase inhibitor is an analog of 
apJddicfcMM^ (L-FMAU) or 
2',3'-dideoxyribonucleoside 5'-triphosphates. 

17. The method of claim 1 wherein said antisense oligonucleotide comprises about 15 
consecutive nucleotides that are complementary to the coding strand of a mismatch repair 
protein, wherein said antisense oligonucleotide specifically binds to said coding strand of 
said mismatch repair protein under physiological conditions and inhibits mismatch repair 
activity of said mismatch repair protein. 

18. The method of claim 17 wherein said antisense oUgonucleotide specifically binds 
to a regulatory portion on said coding strand of said mismatch repair protein. 

19. The method of claim 17 wherein said antisense oligonucleotide is directed against 
the first six codons of a MMR gene message. 

20. The method of claim 1 wherein said inhibitor of mismatch repair is introduced into 
a growth medium of a eukaryotic cell in vitro. 



21. The method of claim 1 wherein said inhibitor of mismatch repair is introduced into 
a growth medium of a prokaryotic cell in vitro. 

22. The method of claim 1 wherein said inhibitor of mismatch repair is introduced into 
a growth medium of a plant 

23. A method for generating a mutation in a gene of interest comprising exposing a cell 
comprising said gene of interest to a chemical mismatch repair inhibitor and testing said 
cell to determine whether said gene of interest comprises a mutation. 
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24. The method of claim 23 wherein said testing comprises analyzing a polynucleotide 
sequence of said gene of interest. 

25. The method of claim 23 wherein said testing comprises analyzing a protein 
encoded by said gene of interest 

26. The method of claim 23 wherein said testing comprises analyzing the phenotype of 
said cell. 

n^^ceUismadenrismateh repair defective by e^smgsammannnananeell to a* 
inhibitor of mismatch repair. 

28 • The metod of clato 27*^ comprise 

n^ateh repair after detennining that said gene of interest eomprises a mutation. 

29. The method of claim 27 wherein said testing eomprises analyzing a polynneleotide 
sequence of said gene of interest. 

30. The method of claim 27 wherein said testing comprises analyzing a protein 
encoded by said gene of interest. 



31. The 
said cell. 



method of claim 27 wherein said testing comprises analyzmg the phenotype of 



32 Amemodforgeneratmgamutationmageneofmterestcomprismge 
^to achemicalinMbitorofmismatchrepa^ 
whether the gene of interest comprises a mutation. 



33 



The method of claim 32 wherein said animal is a mammal. 
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34. The method of claim 32 wherein said testing comprises analyzing a polynucleotide 
sequence of said gene of interest. 

35. The method of claim 32 wherein said testing comprises analyzing a protein 
encoded hy said gene of interest. 

36. The method of claim 32 wherein said testing comprises analyzing the phenotype of 
said cell. 

37. The method of claim 33 wherein said mammal is made mismatch repair defective 
by exposing said mammal to an inhibitor of mismatch repair. 

38. The method of claim 37 former comprising removing said inhibitor of mismatch 
repair after deternrining that said gene of interest comprises a mutation. 

39. A hypermutable transgenic mammal made by the method of claim 33. 

40. A method for generating a mismatch repair defective plant comprising exposing 
said plant to an inhibitor of mismatch repair. 

41. A method for generating a mutation in a gene of interest comprising growing a 
plant comprising said gene of interest, exposing said plant to an inhibitor of mismatch 
repair, and testing said plant to determine whether said gene of interest comprises a 
mutation. 

42. The method of claim 41 wherein said testing comprises analyzing a polynucleotide 
sequence of said gene of interest. 

43. The method of claim 41 wherein said testing comprises analyzing a protein 
encoded by said gene of interest. 



44. The 



method of claim 41 wherein said testing comprises analyzing the phenotype of 
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saidplaat. 

45. V »^«^*^*J^*^'*^^'^* 
exposing said plant to an inhibitor of mismatch repair. 

46 . A hypermutable plant made by the method of claim 40. 

47. The plant of claim 46 wherein said plant is monocot 

48. The plant of claim 46 wherein said plant is dicot. 
49 Ame modforscreenmgfor^^ 

for microsatellite instability. 

50. The method of claim 49 wherein said organism is a mammal. 

51. The method of claim 49 wherein said organism is a microbe. 

52. The method of claim 49 wherein said organism is a plant 

53. The method of claim 49 wherein said screenmg compnses monitonng endogenous 
microsatellites. 

within a coding region of said reporter gene. 

55. The method of claim 54 wherein add reporter gene is p-glucuronidase. 

56. Amemodforblocldngmis^^ 
to an anthracene compound. 
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The method of claim 56 wherein said anthracene comprises the formula: 




substituted alkenyl, alkynyl, substituted afcynyl, O-alkyl, MA N-aKyl, O-alkenyl, S- 
atenyi, )W«VMnt S-alk^N-alkynyl, ■* «* " yl ° Xy ' 

aryloxy, bereft, aubatibned heterceryl, aralkyloxy, qHU 
-HM. alkylaulfonyl, alkottycarbonyl, aryloxycarbonyl, guanidino, carboxy, an alcohol, 
. amino acid, sulfonate, alkyl sulfonato, CN, NO, an aldehyde group, an ester, an ether a 

aHKganosilrcon or a carbohydrate mat optionally contains one or more alkylated hydroxy, 
groups; 

whereto said heteroalkyl, kMA and substituted heteroary. contain at least one 
heteroatom that is oxygen, sulfur, a metal atom, phosphorus, silicon or nitrogen; and 

wherein said aubsttaents of said substituted alkyl, substituted alkenyl, substituted 

alkynyl 

«dkA -*»tay. *^ ta * *"* llyaI0!ty • ^ md ammo; a 

and who* aaid amino groups optionally aubatimted with an acyl group, or I to 3 aryl 
or lower alkyl groups. 

58. The method of claim 57 whereinR* and R« are hydrogen. 

59. The method of claim 57 whe^mR^o are independent alkyl, 
aryl, arylaklyl, or hydroxyalkyl. 
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60 The method of claim 57 wherein R r R 10 are independently hydrogen, hydroxyl, methyl 
ethyl, propyl, isopropyl, butyl, isobutyl, phenyl, tolyl, hydroxymethyl, hydroxypropyl, or 
hydroxytmtyL 

61 TtemetbodofctoimSTwh^srida,^^ 

of 12-dtocthytenteacenc 9,10-dtaelhyl anthracene, T^^ytotocene, 9,10- 
oipneaylanttacene, 9,10-iihydtoxym.lhylantocene, 9-hydroxymethyl-lO- 
methylantbracene, dtarthylantoc«»-lXliol, 9-hydtoxymethyl-10-m e thylanth ra c«n e -l,2- 
^hydtoxymethyl^O-methylantteaceae-S^ol, and 9, 10^-m-tolyantacene. 

62. Themethod of claim 57 wherein R 3 , R* R* R* R» ***** arehydrogen. 

63. The method of claim 57 whe^^ 

64. Themethod of claim 57 wherein R, R» R 3 , R» Rs, R. ^ ****** hydrogen. 

65. The method of claim 57 wherein R„ R, R 3 , R* R* R* R> and R l0 are hydrogen. 

66 . The method of claim 57 wherein R„ R, R 3 , . *. R* and R, are hydrogen. 

67. The method of claim 57 wherein R„ R,, R 3 , R* R» *. ^ ** ^ R » 315 , V dn * aL 

68. The method of claim 23 further comprising exposing said cell to amutagen. 

r 

69. The method of claim 32 further comprising exposing said animal to a mutagen. 

70. The method of claim 68 or 69 wherein said mutagen is selected from the group 
consisting of N-memyl-N^mtro-N-mtrosoguamdine, methane sulfonate, dimethyl 
sulfonate, O-6-methyl benzadine, ethyl memanesulfonate, methylnitrosourea, and 
ethylnitrosourea. 
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71 . The method of claim 49 wherein the chemcial is a MMR inhibitor wherein it 
induces microsatellite instability in MMR proficient cells but does not induce enhanced 
microsattelite instability in MMR deficient cells. 
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<110> Nicolaides, Nicholas C 
Grasso, Luigi 
Sass, Philip M 

<120> CHEMICAL INHIBITORS OF MISMATCH REPAIR 
<130> MOR-0018 

<140> 00/000,000 
<141> 2001-01-15 

<160> 44 

<170> Patent In Ver. 2.1 

<210> 1 
<211> 52 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence -.oligonucleotide 
primer 

ggatcctaat acgactcact atagggagac caccatgtcg ttcgtggcag gg 52 



<210> 2 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

< 2 223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 2 r 21 

taagtcttaa gtgctaccaa c 



<210> 3 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
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<220> • 4-w of Artificial Sequence : oligonucleotide 

<223> Description o£ m^ 1 " 1 - 

primer 

r/ccLt acactcact at. OT o,,.c cacc,t 99 aa cat^ct, « 



<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> rt f Artificial Sequence: oligonucleotide 

<223> Description of Artificial ae* 

primer 

<400> 4 20 
aggttagtga agactctgtc 

<210> 5 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> • +* m rf Artificial Sequence: oligonucleotide 

<223> Description of Artmciai. 

primer 

<400> 5 22 
ctgatctcac ggacaatagt gc 



<210> 6 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

Description of Artificial Seance, „li,onu=l.otid. 
primer 

<.400> 6 20 
ggctccataa aaagtgcacc 



<210> 7 
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<211> 22 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 7 22 
ggtctgttga tgtcgtaagt eg 



<210> 8 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



< 2 2 2 23> Description of Artificial Sequence oligonucleotide 
primer 

<400> 8 22 
atcttgaaac ctttagggag gg 



<210> 9 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



tZ Description of Artificial sconce: oligonucleotide 
primer 

<400> 9 18 
agaagtttag acaggtac 



<210> 10 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> . ^ artificial Sequence -.oligonucleotide 

<223> Description of Arti.t3.ciax d«^u 

primer 



<400> 10 
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aaatgtgcaa ttgccttc 
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18 - 



<210> 11 
<211> 51 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

clcggaicca tgttaaaaaa aaaaaaaaaa aaaaaacgtc ctgtagaaac c 51 



<210> 12 
<211> 48 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

ctcgga"ca tgttaaaaaa aaaaaaaaaa aaaacgtcct gtagaaac 



<210> 13 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 13 30 
cccgaattcc ccgatctagt aacatagatg 



<210> 14 

<211> 859 

<212> PRT 

<213> Mus musculus 

<400> 14 
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Met Gl» B. Thr Glu Gly « S.r Thr Glu Cys Ale Ly, Al, lie Lys 



1 



Pro II Asp Gly Lys Sex vel His Gin He Cys Ser Gly Gin Vel lie 

20 25 
M ser Leu Ser Thr Ale Vel Lye Glu Leu He Glu Asn Ser Vel Asp 

ALa Gly Ale Thr Thr lie Asp Leu Ar, Leu Lys Asp Tyr Gly vel Asp 
50 55 

ri« evs Glv Val Glu Glu Glu Asn Phe 
Leu He Glu Val Ser Asp Asn Gly Cys GJ.y va ^ 

65 10 75 

u< = Thr Ser Lvs He Gin Glu Phe Ala 
Glu Gly Leu Ala Leu Lys Hxs His Thr Ser l.ys ^ 

85 90 



Mp Leu Thr Gin Vel Glu Thr Phe Gly The Arg Gly Glu Ale Leu Ser 

100 105 
ser Leu Cys Me Leu Ser Asp Vel Thr lie Ser Thr Cys Bis Gly Ser 

115 120 
sl a ser Vel Gly Thr Arg Leu Vel Ph. Asp His Asn Gly Lys lie Thr 
130 - 135 

* _ r -i„ n>hr Thr Val Ser Val Gin 
Gin Lys Thr Pro Tyr Pro Arg Pro Lys Gly Thr ^ 

145 • 150 155 

™=i irn Tvr Lvs Glu Phe Gin Arg Asn 
His Leu Phe Tyr Thr Leu Pro Val Arg Tyr Lys ^ 

165 1/u 

« t„ c M »t- Val Gin Val Leu Gin Ala Tyr Cys 
He Lys Lys Glu Tyr Ser Lys Met Val Gin 

180 185 
Ile He ser Me Gly val Ar, Vel Ser cys Thr asn Gin Leu Gly Gin 

195 200 
Gly Lys Arg His Ale Vel Vel Cys Thr Ser Gly Thr ser Gly mt Lys 

210 215 
«. *sn U. Gly ser Vel Phe Gly Gin Lys Gin Leu Gin Ser Leu lie 
225 "° 

Pro Phe vel Gin Leu Pro Pro Ser Asp Ale vel Cys Glu Glu Tyr Gly 



245 
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Leu Ser Thr Ser Gly Arg His Lys Thr Phe Ser Thr Phe Arg Ala Ser 
260 265 270 

Phe His Ser Ala Arg Thr Ala Pro Gly Gly Val Gin Gin Thr Gly Ser 
275 280 285 

Phe Ser Ser Ser lie Arg Gly Pro Val Thr Gin Gin Arg Ser Leu Ser 
290 295 300 

Leu Ser Met Arg Phe Tyr His Met Tyr Asn Arg His Gin Tyr Pro Phe 
305 



310 315 320 



Val Val Leu Asn Val Ser Val Asp Ser Glu Cys Val Asp He Asn Val 
325 330 335 

Thr Pro Asp Lys Arg Gin He Leu Leu Gin Glu Glu Lys Leu Leu Leu 
340 345 350 

Ala Val Leu Lys Thr Ser Leu lie Gly Met Phe Asp Ser Asp Ala Asn 
355 360 365 

Lys Leu Asn Val Asn Gin Gin Pro Leu Leu Asp Val Glu Gly Asn Leu 
370 375 380 

Val Lys Leu His Thr Ala Glu Leu Glu Lys Pro Val Pro Gly Lys Gin 

_ TQt; 400 

385 390 395 

Asp Asn Ser Pro Ser Leu Lys Ser Thr Ala Asp Glu Lys Arg Val Ala 
405 410 415 

Ser lie Ser Arg Leu Arg Glu Ala Phe Ser Leu His Pro Thr Lys Glu 
420 425 430 

lie Lys Ser Arg Gly Pro Glu Thr Ala Glu Leu Thr Arg Ser Phe Pro 
435 440 445 

Ser Glu Lys Arg Gly Val Leu Ser Ser Tyr Pro Ser Asp Val lie Ser 
450 455 460 

Tyr Arg Gly Leu Arg Gly Ser Gin Asp Lys Leu Val Ser Pro Thr Asp 
465 470 > 475 



Ser Pro 



Gly Asp Cys Met Asp Arg Glu Lys He Glu Lys Asp Ser Gly 



485 



490 495 



Leu Ser Ser Thr Ser Ala Gly Ser Glu Glu Glu Phe Ser Thr Pro Glu 
500 



505 510 
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w e Q*r Aso Tyr Asn Val Ser Ser Leu Glu Asp 
Val Ala Ser Ser Phe Ser Ser Asp Tyr * ^ 



515 520 
^ Pro Ser Gin Glu Thr He Asn Cys Gly Asp Leu Asp Cys Arg Pro 

530 535 
Pto «, T„r «, Gin S,r M W s Pro «. U, «. "V Tyr G ln CJ. 
545 550 

«i a™ Leu Ser Pro Thr Asn Ala Lys Arg Phe 
Lys Ala LeU Pro Leu Ala Arg Leu Ser ^ ^ 

565 

* >.„ val Asn He Ser Gin Arg Leu Pro 
Lys Thr Glu Glu Arg Pro Ser Asn Val Asn ^ 

580 585 



o sla Ala Glu Val Asp Val Ala He Lys Met 
Sly Pro Gin Ser Thr Ser Ala Ala Glu Val P ^ 

595 600 

* „ Tie Val Leu Leu Glu Phe Ser Leu Ser Ser Leu Ala Lys 
Asn Lys Arg He Val Leu i* Q2Q 

610 615 
ttg « „ «. - - «. * - - - * »' »" IT. 



625 630 



-ri n, fC Pm Glv Glu Asn Gin Ala 
Ser Tyr Arg Lys Phe Arg Ala Lys He Cys Pro Gly ^ 

645 

riv Tie Ser Lys Ser Met Phe Ala Glu 
Ala Glu Asp Glu Leu Arg Lys Glu lie Ser Ly ^ 

660 665 

M . t „ n. - «, - - - - « - IS IM ~ ^ 

675 680 

m,. Leu Val Asp Gin His Ala Ala Asp Glu Lys Tyr 
Lys Glu Asp Leu Phe Leu Val *s P ^ 

690 695 

n, rm His Thr Val Leu Gin Ala Gin Arg Leu 
Asn Phe Glu Met Leu Gin Gin His Thr 72Q 

705 710 

Ile Thr PrG Gin Thr Leu Asn Leu Thr Ala Val Asn Glu Ala Val Leu 
725 

r, Ti. Phe Arg Lys Asn Gly Phe Asp Phe Val He 
He Glu Asn Leu Glu He Phe Arg W 75Q 

740 

„ «. » «. » ~ - - - - * leu s ser M ° EI ° 

755 
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Thr Ser Lys Asn Trp Thr Phe Gly Pro Gin Asp lie Asp Glu Leu He 
770 775 780 

Phe Met Leu Ser Asp Ser Pro Gly Val Met Cys Arg Pro Ser Arg Val 
785 790 795 800 

Arg Gin Met Phe Ala Ser Arg Ala Cys Arg Lys Ser Val Met He Gly 
805 810 815 

Thr Ala Leu Asn Ala Ser Glu Met Lys Lys Leu He Thr His Met Gly 
820 825 830 

Glu Met Asp His Pro Trp Asn Cys Pro His Gly Arg Pro Thr Met Arg 
835 840 845 

His Val Ala Asn Leu Asp Val He Ser Gin Asn 
850 855 



<210> 15 
<211> 3056 
<212> DNA 

<213> Mus musculus 
<400> 15 

gaattccggt gaaggtcctg aagaatttcc 
taacctgtcg tcaggtaacg atggtgtata 
gtcttttccc gagagcggca ccgcaactct 
catccatgga gcaaaccgaa ggcgtgagta 
atgggaagtc agtccatcaa atttgttctg 
tgaaggagtt gatagaaaat agtgtagatg 
aagactatgg ggtggacctc attgaagttt 
actttgaagg tctagctctg aaacatcaca 
cgcaggttga aactttcggc tttcgggggg 
atgtcactat atctacctgc cacgggtctg 
ataatgggaa aatcacccag aaaactccct 
tgcagcactt attttataca ctacccgtgc 
aggagtattc caaaatggtg caggtcttac 
gtgtaagctg cactaatcag ctcggacagg 
gcacgtctgg catgaaggaa aatatcgggt 
tcattccttt tgttcagctg ccccctagtg 
cttcaggacg ccacaaaacc ttttctacgt 
cgccgggagg agtgcaacag acaggcagtt 
agcaaaggtc tctaagcttg tcaatgaggt 
catttgtcgt ccttaacgtt tccgttgact 
ataaaaggca aattctacta caagaagaga 
tgataggaat gtttgacagt gatgcaaaca 
atgttgaagg taacttagta aagctgcata 



agattcctga gtatcattgg aggagacaga 60 
tgcaacagaa atgggtgttc ctggagacgc 120 
cccgcggtga ctgtgactgg aggagtcctg 180 
cagaatgtgc taaggccatc aagcctattg 240 
ggcaggtgat actcagttta agcaccgctg 300 
ctggtgctac tactattgat ctaaggctta 360 
cagacaatgg atgtggggta gaagaagaaa 420 
catctaagat tcaagagttt gccgacctca 480 
aagctctgag ctctctgtgt gcactaagtg 540 
caagcgttgg gactcgactg gtgtttgacc 600 
acccccgacc taaaggaacc acagtcagtg 660 
gttacaaaga gtttcagagg aacattaaaa 720 
aggcgtactg tatcatctca gcaggcgtcc 780 
ggaagcggca cgctgtggtg tgcacaagcg 840 
ctgtgtttgg ccagaagcag ttgcaaagcc 900 
acgetgtgtg tgaagagtac ggcctgagca 960 
ttcgggcttc atttcacagt gcacgcacgg 1020 
tttcttcatc aatcagaggc cctgtgaccc 1080 
tttatcacat gtataaccgg catcagtacc 1140 
cagaatgtgt ggatattaat gtaactccag 1200 
agctattgct ggccgtttta aagacctcct 1260 
agcttaatgt caaccagcag ccactgctag 1320 
ctgcagaact agaaaagcct gtgccaggaa 1380 
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agcaagataa ctctccttca ctgaagagca 
ccaggctgag agaggccttt tctcttcatc 
agactgctga actgacacgg agttttccaa 
cttcagacgt catctcttac agaggcctcc 
cggacagccc tggtgactgt atggacagag 
gcacctcagc tggctctgag gaagagttca 
gtgactataa cgtgagctcc ctagaagaca 
acctggactg ccgtcctcca ggtacaggac 
aatgcaaagc tctacctcta gctcgtctgt 
aggaaagacc ctcaaatgtc aacatttctc 
cagctgaggt cgatgtagcc ataaaaatga 
tgagttctct agctaagcga atgaagcagt 
aactgagtta cagaaaattt agggccaaga 
atgaactcag aaaagagatt agtaaatcga 
ttaacctggg atttatagta accaaactga 
ctgcggatga gaagtacaac tttgagatgc 
ggctcatcac accccagact ctgaacttaa 
atctggaaat attcagaaag aatggctttg 
ctgaaagggc taaattgatt tccttaccaa 
atatagatga actgatcttt atgttaagtg 
gagtcagaca gatgtttgct tccagagcct 
tcaatgcgag cgagatgaag aagctcatca 
actgccccca cggcaggcca accatgaggc 
actgacacac cccttgtagc atagagttta 
ttttaagtaa tctgattatc gttgtacaaa 
catttaaaag cagtgttaag gcaggcatga 
tgatccggtg ggagctcatg tgagcccagg 
agactcaatt caaggacaaa aaaaaaaaga 



cagcagacga gaaaagggta gcatccatct 1440 
ctactaaaga gatcaagtct aggggtccag 1500 
gtgagaaaag gggcgtgtta tcctcttatc 1560 
gtggctcgca ggacaaattg gtgagtccca 1620 
agaaaataga aaaagactca gggctcagca 1680 
gcaccccaga agtggccagt agctttagca 1740 
gaccttctca ggaaaccata aactgtggtg 1800 
agtccttgaa gccagaagac catggatatc 1860 
cacccacaaa tgccaagcgc ttcaagacag 1920 
aaagattgcc tggtcctcag agcacctcag 1980 
ataagagaat cgtgctcctc gagttctctc 2040 
tacagcacct aaaggcgcag aacaaacatg 2100 
tttgccctgg agaaaaccaa gcagcagaag 2160 
tgtttgcaga gatggagatc ttgggtcagt 2220 
aagaggacct cttcctggtg gaccagcatg 2280 
tgcagcagca cacggtgctc caggcgcaga 2340 
ctgctgtcaa tgaagctgta ctgatagaaa 2400 
actttgtcat tgatgaggat gctccagtca 2460 
ctagtaaaaa ctggaccttt ggaccccaag 2520 
acagccctgg ggtcatgtgc cggccctcac 2580 
gtcggaagtc agtgatgatt ggaacggcgc 2640 
cccacatggg tgagatggac cacccctgga 2700 
acgttgccaa tctggatgtc atctctcaga 2760 
ttacagattg ttcggtttgc aaagagaagg 2820 
aattagcatg ctgctttaat gtactggatc 2880 
tggagtgttc ctctagctca gctacttggg 2940 
actttgagac cactccgagc cacattcatg 3000 
tatttttgaa gccttttaaa aaaaaa 3056 



<210> 16 

<211> 862 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Glu Arg Ala Glu Ser Ser Ser Thr Glu Pro Ala Lys Ala He Lys 
15 10 15 

Pro He Asp Arg Lys Ser Val His Gin He Cys Ser Gly Gin Val Val 
20 25 30 

Leu Ser Leu Ser Thr Ala Val Lys Glu Leu Val Glu Asn Ser Leu Asp 
35 40 45 

Ala Gly Ala Thr Asn He Asp Leu Lys Leu Lys Asp Tyr Gly Val Asp 
50 55 60 
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Leu lie Glu Val Ser Asp Asn Gly Cys Gly Val Glu Glu Glu Asn Phe 
65 70 75 

Glu Gly Leu Thr Leu Lys His His Thr Ser Lys He Gin Glu Phe Ala 

90 90 



85 



Asp Leu Thr Gin Val Glu Thr Phe Gly Phe Arg Gly Glu Ala Leu Ser 

ioo i° 5 110 

Ala Leu Ser Asp Val Thr lie Ser Thr Cys His Ala Ser 



Ser Leu Cys Ala Leu ser «sp — " - 

115 120 125 

Ala Lys Val Gly Thr Arg Leu Met Phe Asp His Asn Gly Lys lie lie 



130 135 140 



Gin Lys Thr Pro Tyr Pro Arg Pro Arg Gly Thr Thr Val Ser Val Gin 
145 "0 I 55 

Gin Leu Phe Ser Thr Leu Pro Val Arg His Lys Glu Phe Gin Arg Asn 
165 I 70 175 

lie Lys Lys Glu Tyr Ala Lys Met Val Gin Val Leu His Ala Tyr Cys 



180 



185 I 90 



lie lie Ser Ala Gly He Arg Val Ser Cys Thr Asn Gin Leu Gly Gin 
195 200 205 

Gly Lys Arg Gin Pro Val Val Cys Thr Gly Gly Ser Pro Ser lie Lys 
210 215 220 

Glu Asn lie Gly Ser Val Phe Gly Gin Lys Gin Leu Gin Ser Leu lie 
225 230 235 240 

Pro Phe Val Gin Leu Pro Pro Ser Asp Ser Val Cys Glu Glu Tyr Gly 
245 250 



Leu Ser Cys Ser Asp 
260 



Ala Leu His Asn Leu Phe Tyr He Ser Gly Phe 



265 270 



lie Ser Gin Cys Thr His Gly Val Gly Arg Ser Ser Thr Asp Arg Gin 
275 280 C 285 

Phe Phe Phe He Asn Arg Arg Pro Cys Asp Pro Ala Lys Val Cys Arg 



290 



295 



300 



Leu Val Asn Glu Val Tyr His Met Tyr Asn Arg His Gin Tyr Pro Phe 
305 ^ S 



310 315 
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Val Val Leu Asn He Ser Val Asp Ser Glu Cys Val Asp He Asn Val 
325 330 335 

Thr Pro Asp Lys Arg Gin He Leu Leu Gin Glu Glu Lys Leu Leu Leu 
340 345 350 

Ala Val Leu Lys Thr Ser Leu He Gly Met Phe Asp Ser Asp Val Asn 
355 360 365 

Lys Leu Asn Val Ser Gin Gin Pro Leu Leu Asp Val Glu Gly Asn Leu 
370 375 380 

He Lys Met His Ala Ala Asp Leu Glu Lys Pro Met Val Glu Lys Gin 
385 390 395 400 

Asp Gin Ser Pro Ser Leu Arg Thr Gly Glu Glu Lys Lys Asp Val Ser 
405 410 415 

He Ser Arg Leu Arg Glu Ala Phe Ser Leu Arg His Thr Thr Glu Asn 
420 425 430 

Lys Pro His Ser Pro Lys Thr Pro Glu Pro Arg Arg Ser Pro Leu Gly 
435 440 445 

Gin Lys Arg Gly Met Leu Ser Ser Ser Thr Ser Gly Ala He Ser. Asp 
45D 455 460 

Lys Gly Val Leu Arg Pro Gin Lys Glu Ala Val Ser Ser Ser His Gly 
465 470 475 480 

Pro Ser Asp Pro Thr Asp Arg Ala Glu Val Glu Lys Asp Ser Gly His 
485 490 495 

Gly Ser Thr Ser Val Asp Ser Glu Gly Phe Ser He Pro Asp Thr Gly 
500 505 510 

Ser His Cys Ser Ser Glu Tyr Ala Ala Ser Ser Pro Gly Asp Arg Gly 
515 520 525 

Ser Gin Glu His Val Asp Ser Gin Glu Lys Ala Pro Glu Thr Asp Asp 

530 535 ;' ' 540 

Ser Phe Ser Asp Val Asp Cys His Ser Asn Gin Glu Asp Thr Gly Cys 
545 550 555 560 

Lys Phe Arg Val Leu Pro Gin Pro Thr Asn Leu Ala Thr Pro Asn Thr 
565 570 575 
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Lys *r, Phe Lys Lys Glu Glu XI. Leu Ser ser ser «P £ eye Gin 
580 585 

Lys Leu Val *sn Thr Gin -P « « - - "» £ *" ^ "* 
595 600 

Vel Ly. XX. - Lys Lys « Vel Pro Leu MP Ph. Sex Met S.r Ser 

610 615 

L eu Me Lys hr, U. L„ Gin Leu Hi. His Glu Me Gin Gin Ser Glu 

ff25 « 3 ° " 5 



G1 , Gl» Gin Asn Tyr W Lys Phe Ar, Me Ly. H. Cy. Pro Gly Gin 
645 650 

, », a! a Glu Asp Glu Leu Arg Lys Glu lie Ser Lys Thr Met 
Asn Gin Ala Ala Glu Asp «»xu » g7Q 

660 665 
Phe Me Gin Met Gin Xie XX. Giy Gin Pee Asn L.U Gly Phe XI. XI. 

675 680 
„ Lys L.U » Gin UP XI. Ph. XI. vel Asp Gin His M. Thr Asp 
690 695 

rin His Thr Val Leu Gin Gly 
Glu Lys Tyr Asn Phe Glu Met Leu Gin Gin His Thr ^ 

705 710 71 

fln Arg Leu lie Ala Pro Gin Thr Leu Asn Leu Thr Ala Val Asn Glu 
725 7JU 

t „ rii, He Phe Arg Lys Asn Gly Phe Asp 
Ala Val Leu lie Glu Asn Leu Glu lie Phe Arg y 

740 745 
Phe Val lie Asp Glu Asn Ala Pro Val Thr Glu Arg Ala Lys Leu He 
755 760 



Glu 



Ser Len Pro Thr S.r Ly, Asn Trp Thr Ph. Gly Pro Gin Asp Vel Asp 
770 775 

to XI. Ph. Hot L.u S.r ASP S.r Pro Gly Vel «.t eye Ar, Pro 
790 795 

ser Ar g Vel Lys Gin Met Ph. Me ser Ar, Me Cys Ar 9 lys Ser Vel 

805 810 
M .t XI. Gly Thr Me len hen Thr Ser Glu Met Lys lys UJJ XI. Thr 



820 825 
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His Met Gly Glu Met Asp His Pro Trp Asn Cys Pro His Gly Arg Pro 
835 840 845 

Thr Met Arg His He Ala Asn Leu Gly Val He Ser Gin Asn 
850 855 860 



<210> 17 
<211> 2771 
<212> DNA 

<213> Homo sapiens 
<400> 17 

cgaggcggat cgggtgttgo atccatggag cgagctgaga gctcgagtac agaacctgct 60 
aaggccatca aacctattga tcggaagtca gtccatcaga tttgctctgg gcaggtggta 120 
ctgagtctaa gcactgcggt aaaggagtta gtagaaaaca gtctggatgc tggtgccact 180 
aatattgatc taaagcttaa ggactatgga gtggatctta ttgaagtttc agacaatgga 240 
tgtggggtag aagaagaaaa cttcgaaggc ttaactctga aacatcacac atctaagatt 300 
caagagtttg ccgacctaac tcaggttgaa acttttggct ttcgggggga agctctgagc 360 
tcactttgtg cactgagcga tgtcaccatt tctacctgcc acgcatcggc gaaggttgga 420 
actcgactga tgtttgatca caatgggaaa attatccaga aaacccccta cccccgcccc 480 
agagggacca cagtcagcgt gcagcagtta ttttccacac tacctgtgcg ccataaggaa 540 
tttcaaagga atattaagaa ggagtatgcc aaaatggtcc aggtcttaca tgcatactgt 600 
atcatttcag caggcatccg tgtaagttgc accaatcagc ttggacaagg aaaacgacag 660 
cctgtggtat gcacaggtgg aagccccagc ataaaggaaa atatcggctc tgtgtttggg 720 
cagaagcagt tgcaaagcct cattcctttt gttcagctgc cccctagtga ctccgtgtgt 780 
gaagagtacg gtttgagctg ttcggatgct ctgoataatc ttttttacat ctcaggtttc 840 
atttcacaat gcacgcatgg agttggaagg agttcaacag acagacagtt tttctttatc 900 
aaccggcggc cttgtgaccc agcaaaggtc tgcagactcg tgaatgaggt ctaccacatg 960 
tataatcgac accagtatcc atttgttgtt cttaacattt ctgttgattc agaatgcgtt 1020 
gatatcaatg ttactccaga taaaaggcaa attttgctac aagaggaaaa gcttttgttg 1080 
gcagttttaa agacctcttt gataggaatg tttgatagtg atgtcaacaa gctaaatgtc 1140 
agtcagcagc cactgctgga tgttgaaggt aacttaataa aaatgcatgc agcggatttg 1200 
gaaaagccca tggtagaaaa gcaggatcaa tccccttcat taaggactgg agaagaaaaa 1260 
aaagacgtgt ccatttccag actgcgagag gccttttctc ttcgtcacac aacagagaac 1320 
aagcctcaca gcccaaagac tccagaacca agaaggagcc ctctaggaca gaaaaggggt 1380 
atgctgtctt ctagcacttc aggtgccatc tctgacaaag gcgtcctgag acctcagaaa 1440 
gaggcagtga gttccagtca cggacccagt gaccctacgg acagagcgga ggtggagaag 1500 
gactcggggc acggcagcac ttccgtggat tctgaggggt tcagcatccc agacacgggc 1560 
agtcactgca gcagcgagta tgcggccagc tccccagggg acaggggctc gcaggaacat 1620 
gtggactctc aggagaaagc gcctgaaact gacgactctt tttcagatgt ggactgccat 1680 
tcaaacoagg aagataccgg atgtaaattt cgagttttgc ctoagccaac taatctcgca 1740 
accccaaaca caaagcgttt taaaaaagaa gaaattcttt ccagttctga catttgtcaa 1800 
aagttagtaa atactcagga catgtcagcc tctcaggttg atgtagctgt gaaaattaat 1860 
aagaaagttg tgcccctgga cttttctatg agttctttag ctaaacgaat aaagcagtta 1920 
catcatgaag cacagcaaag tgaaggggaa cagaattaca ggaagtttag ggcaaagatt 1980 
tgtcctggag aaaatcaagc agccgaagat gaactaagaa aagagataag taaaacgatg 2040 
tttgcagaaa tggaaatcat tggtcagttt aacctgggat ttataataac caaactgaat 2100 
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gaggata tct tcata^a cca^cc 2160 
cagcagcaca ccgtgctcca ggggcagagg ctcatagcac ^cagac 2280 
gctgttaatg aagctgttct gatagaaaat ctggaaatat "agaaagaa tgg g 
?ttgttatcg atgaaaatgc tcca g tcact ^aagggcta aactgatt^ cttgc 
ag taaaaact ggaccttcgg accccaggac gtcgatgaac tg-tctt«t 
agccctgggg tcatgtgccg gcc = t aga tgtttgc^ J ^ ^ 
cggaagtcgg tgatgattgg ^ctgctctt aacacaagcg * * ^ catgagacac 258 0 

=s= ss= =: » ~: ss 
sss srs =. — - 

cttttcaaac c 



<210> 18 

<211> 932 

<212> PRT 

<213> Homo sapiens 



r^Aln ™ Pro Ale A!e Thr Vel «, - Leu Ser So, Ser Oln 

1 5 10 

U. U. Thr S,r Vel Vel S.r V,! Vel Lye Slu Leu He Slu Asn Ser 
20 25 

Le u Ae P Ale Sly Ale Thr Ser Vel Asp Vel Lye leu Sle Asn Tyr Sly 

35 40 
Ph e «P lye XI. Slu Vel Ar, Asp Asn Sly Slu Sly 11. lye Ale Vel 

50 55 
ASP Ala Pro Val Met Ala Met Lys Tyr Tyr Thr Ser Lys He Asn Ser 
65 70 

, , t~„ -rhr Thr Tvr Gly Phe Arg Gly Glu Ala 
His Glu Asp Leu Glu Asn Leu Thr Thr Tyr t y ^ 

85 5,0 
L eu Gly Ser He Cys Cys He Ala Glu Val Leu Xle Thr Thr Arg Thr 



100 • 105 



Th r Gin Tyr Val Leu Asp Gly Ser Gly His 

115 



Ala Ala Asp Asn Phe Ser Thr w.xi w - - - - 

120 x ^ 



Ile leu Ser Sin lye Pro Ser Hie lee Sly Sin Sly Thr Thr Vel Thr 
130 135 
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145 



150 



155 



160 



Thr Ala Lys Lys Cys Lys Asp Glu lie Lys Lys lie Gin Asp Leu Leu 
165 170 175 

Met Ser Phe Gly lie Leu Lys Pro Asp Leu Arg He Val Phe Val His 
160 165 190 

Asn Lys Ala Val He Trp Gin Lys Ser Arg Val Ser Asp His Lys Met 
195 200 205 

Ala Leu Met Ser Val Leu Gly Thr Ala Val Met Asn Asn Met Glu Ser 
210 215 220 

Phe Gin Tyr His Ser Glu Glu Ser Gin He Tyr Leu Ser Gly Phe Leu 
225 230 235 240 

Pro Lys Cys Asp Ala Asp His Ser Phe Thr Ser Leu Ser Thr Pro Glu 
245 250 255 

Arg Ser Phe He Phe He Asn Ser Arg Pro Val His Gin Lys Asp He 
260 265 270 

Leu Lys Leu He Arg His His Tyr Asn Leu Lys Cys Leu Lys Glu Ser 
275 280 285 

Thr Arg Leu Tyr Pro Val Phe Phe Leu Lys He Asp Val Pro Thr Ala 
290 295 300 

Asp Val Asp Val Asn Leu Thr Pro Asp Lys Ser Gin Val Leu Leu Gin 
305 310 315 320 

Asn Lys Glu Ser Val Leu He Ala Leu Glu Asn Leu Met Thr Thr Cys 
325 330 335 

Tyr Gly Pro Leu Pro Ser Thr Asn Ser Tyr Glu Asn Asn Lys Thr Asp 
340 345 350 

Val Ser Ala Ala Asp He Val Leu Ser Lys Thr Ala Glu Thr Asp Val 
355 360 365 

Leu Phe Asn Lys Val Glu Ser Ser Gly Lys Asn Tyr Ser Asn Val Asp 
370 375 380 

Thr Ser Val He Pro Phe Gin Asn Asp Met His Asn Asp Glu Ser Gly 
385 390 395 400 



Lys Asn Thr Asp Asp Cys Leu Asn His Gin He Ser He Gly Asp Phe 



15 



PCTAJS01/00934 

WO 02/054856 

405 

Gly Tyr Gly His cys ser ser «u a. ser Asn n. asp £ « 

420 425 
„ys Asn Ale Phe GXn Asp He S« Met Ser Asn V.1 Ser Trp Glu Asn 

435 "° " 5 

S,r Gin Thr Glu Tyr ser Lys Thr Cys the He Ser Ser val Lye Bis 



435 

ei ii Tvr Ser Lys Thr Cys Phe lie 

460 

Th r Gin Ser Glu Asn Gly Asn Lye Asp Hie U. Asp Glu Ser Gly Glu 

47o 



450 455 



465 



470 



Mn Glu Glu Glu Al. Gly Leu Glu «n Ser Ser Glu He Ser Ale Aep 
485 490 

Glu Trp Ser Ar g Gly Asn He Leu Lys Ass Ser Vel Gly Glu Asn He 

500 505 
Glu Pro val Lys lie Leu Vsl Pro Glu Lys Ser Leu Pro Cys Lys Vel 

515 520 
Ser As„ Asn Asn Tyr Pro He Pro Glu Gin Met Asn Leu Asn Glu Asp 

530 " & 
Ser Cys Asn Lys Lys Ser Asn Val He Asp Asn Lys Ser Gly Lys Val 
545 550 555 

Thr Ala Tyr Asp Leu Leu Ser Asn Arg Val He Lys Lys Pro Met Ser 
565 570 

Ui ser Ale Leu Phe vsl Gin Asp Hi, Ar g Pro Gin Phe Leu He Glu 
580 585 

595 600 
It p L ys Thr Leu ser Glu Glu Glu Lys Leu Lys Tyr Glu Giu Lys Ale 

610 615 
Thr Lys Asp Leu Glu Ar, Tyr Asn Ser Gin Met Lys Ar, Ale He Glu 



625 



630 



Gin Glu Ser Gin Met Ser Leu Lys Asp Gly Arg Lys Lys He Lys Pro 



645 



650 



Thr Ser Ala Trp Asn Leu Ala Gin Lys His Lys Leu Lys Thr Ser Leu 
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660 

SK «„ «. « «. - -p »» - - G1 ° ser s Ile Glu Lys 

675 680 

qer Gln Asn ne Lys Met Val Gin He Pro Phe Ser Met Lys 
Arg Arg Ser Gin Asn xxe uy 7QQ 

690 695 
,3, Leu Lys He Asn Phe Lys Lys Gin A sn Lys Val Asp Leu Glu Glu 
705 710 

T , ti*. His Asn Leu Arg Phe Pro Asp Ala Trp 
Lys Asp Glu Pro Cys Leu He His Asn L ^ 

725 /3U 
« Met Tor Ser Ly, Tor Glo vel « - Leu » - Tyr Ar 9 Vel 
740 745 

T Ph* Lvs Ara Leu Leu Glu Asn His Lys Leu Pro 
Glu Glu Ala Leu Leu Phe Lys Arg i-eu 7fi5 

755 760 
Ma Glu Pro Leu Glu Lys Pro U. Met Leu Tor Glu Ser Leu Phe - 
770 775 

T a c « val Leu Tyr Lys Met Thr Ala Asp Asp Gin 
Glv Ser His Tyr Leu Asp Val Leu iyr »y 8Q0 

785 790 

Mg Iyr S.r Gly Ser Thr Tyr Leu Ser Up Pro Ar, - - £ - 
805 810 

Gls Phe Lys He Lye Leu He Pro Gly V al Ser XI. Thr Glu Asn Tyr 

820 825 

, m Ala Asn Cys Leu Pro Phe Tyr Gly Val Ala 

Leu Glu He Glu Gly Met Ala Asn uy ^ 

835 8 

RSP M „ Ly. giu xx. - wn a. n. « « £ « - - - 

850 855 
val Tyr Glu eye Ar, Pro Ar„ Lye vol He Ser Tyr Leu Glu Gly Glu 

65 ^® 
l V.1 Arg Leu Ser Ar, Glo Leu Pro Met Tyr Leu Ser Lys Glu Asp 

885 890 
Ile Glo Asp Tie lie Tyr Ar g Met Lys His Glo Phe Gly Aso Glu He 
900 905 
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915 920 925 



Pro Glu Thr Thr 
930 



<210> 19 

<211> 3063 

<212> DNA 

<213> Homo sapiens 

<40G> 19 

ggcacgagtg gctgcttgcg gctagtggat 
ctgctctgtt aaaagcgaaa atgaaacaat 
gttctcagat catcacttcg gtggtcagtg 
atgctggtgc cacaagcgta gatgttaaac 
tgcgagataa cggggagggt atcaaggctg 
acacctcaaa aataaatagt catgaagatc 
gagaagcctt ggggtcaatt tgttgtatag 
ctgataattt tagcacccag tatgttttag 
cttcacatct tggtcaaggt acaactgtaa 
taagaaagca gttttactca actgcaaaaa 
atctcctcat gagctttggt atccttaaac 
aggcagttat ttggcagaaa agcagagtat 
tggggactgc tgttatgaac aatatggaat 
tttatctcag tggatttctt ccaaagtgtg 
caccagaaag aagtttcatc ttcataaaca 
agttaatccg acatcattac aatctgaaat 
ttttctttct gaaaatcgat gttcctacag 
aaagccaagt attattacaa aataaggaat 
cgacttgtta tggaccatta cctagtacaa 
ccgcagctga catcgttctt agtaaaacag 
aatcatctgg aaagaattat tcaaatgttg 
tgcataatga tgaatctgga aaaaacactg 
gtgactttgg ttatggtcat tgtagtagtg 
atgcatttca ggacatttca atgagtaatg 
gtaaaacttg ttttataagt tccgttaagc 
atatagatga gagtggggaa aatgaggaag 
ctgcagatga gtggagcagg ggaaatatac 
ctgtgaaaat tttagtgcct gaaaaaagtt 
caatccctga acaaatgaat cttaatgaag 
ataataaatc tggaaaagtt acagcttatg 
ccatgtcagc aagtgctctt tttgttcaag 
ctaagactag tttagaggat gcaacactac 
aagaggaaaa actgaaatat gaagagaagg 
aaatgaagag agccattgaa caggagtcac 
taaaacccac cagcgcatgg aatttggccc 
atcaaccaaa acttgatgaa ctccttcagt 

18 



ggtaattgcc tgcctcgcgc tagcagcaag 60 

tgcctgcggc aacagttcga ctcctttcaa 120 

ttgtaaaaga gcttattgaa aactccttgg 180 

tggagaacta tggatttgat aaaattgagg 240 

ttgatgcacc tgtaatggca atgaagtact 300 

ttgaaaattt gacaacttac ggttttcgtg 360 

ctgaggtttt aattacaaca agaacggctg 420 

atggcagtgg ccacatactt tctcagaaac 480 

ctgctttaag attatttaag aatctacctg 540 

aatgtaaaga tgaaataaaa aagatccaag 600 

ctgacttaag gattgtcttt gtacataaca 660 

cagatcacaa gatggctctc atgtcagttc 720 

cctttcagta ccactctgaa gaatctcaga 780 

atgcagacca ctctttcact agtctttcaa 840 

gtcgaccagt acatcaaaaa gatatcttaa 900 

gcctaaagga atctactcgt ttgtatcctg 960 

ctgatgttga tgtaaattta acaccagata 1020 

ctgttttaat tgctcttgaa aatctgatga 1080 

attcttatga aaataataaa acagatgttt 1140 

cagaaacaga tgtgcttttt aataaagtgg 1200 

atacttcagt cattccattc caaaatgata 1260 

atgattgttt aaatcaccag ataagtattg 1320 

aaatttctaa cattgataaa aacactaaga 1380 

tatcatggga gaactctcag acggaatata 1440 

acacccagtc agaaaatggc aataaagacc 1500 

aagcaggtct tgaaaactct tcggaaattt 1560 

ttaaaaattc agtgggagag aatattgaac 1620 

taccatgtaa agtaagtaat aataattatc 1680 

attcatgtaa caaaaaatca aatgtaatag 1740 

atttacttag caatcgagta atcaagaaac 1800 

atcatcgtcc tcagtttctc atagaaaatc 1860 

aaattgaaga actgtggaag acattgagtg : 1920 

ctactaaaga cttggaacga tacaatagtc 1980 

aaatgtcact aaaagatggc agaaaaaaga 2040 

agaagcacaa gttaaaaacc tcattatcta 2100 

cccaaattga aaaaagaagg agtcaaaata 2160 
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. a f a aai-ttt aagaaacaaa 2ZZU 
ttaaaatggt acagatcccc ttttctatga ^ aggtttcctg 2280 

acaaagttga cttagaagag £2£2S aaatccatat agagtagaag 2340 

atgcatggct aatgacatcc aaaacagagg tcctg cagag ccactggaaa 2400 

aagccctgct atttaaaaga cttcttgaga ^cat * cctg^g S 
agccaattat gttaacagag agtcttttta cctgtctgat cctcgtctta 2520 

aaatgacagc agatgaccaa agatacagtg ^caa^tta ^gt^ aattacttgg 2580 
cagcgaatgg tttcaagata aaattgatac ca gagtttc attactg^ ^ 
aaatagaagg aatggctaat tgtctcccat tctatggagt g g cgcaaagtga 2700 

ttaatgctat attaaacaga aatgcaaagg «£*£g. tacttatcaa 2760 

taagttattt agagggagaa gcagtgcgtc WW tggaaat gaaattaaag 2820 
aagaggacat ccaagacatt atctacagaa tgaagcacca £ ^ actacat gat 2880 
ag tgtgttca tggtcgccca ttttttcatc att acct ^ ^ ^ 

taaatatgtt taagaagatt agttaccatt J tgagga ttca 3000 

tctggtttta aattatcttt gtattatgtg tcacatggtt ataaac taat 3060 

ctgaottgtt tttatattga aaaaagttco acgtattgta gaaaa ^ 



aac 



<210> 20 

<211> 934 

<212> PRT 

<213> Homo sapiens 

<400> 20 t rlu Thr Leu Gin Leu Glu Ser Ala Ala Glu 

Met Ala Val Gin Pro Lys Glu Thr Leu b ^ 

1 5 

val Gly «- Val Arg Phe P,e Gin Gly Met Pro Glu Lys Pro Thr Thr 

20 25 

Ihr VaX Leu Pue *sp Ar 9 Gly »» «- * ~ ^ ^ "» G1 " 

.sp Ma l - Ma *a U9 «U Val «- *• ~ - «r Val He 
50 55 

,, a Tvs Asn Leu Gin Ser Val Val Leu 
Lys Tyr Met Gly Pro Ala Gly Ala Lys Asn ue QQ 

65 70 
85 

. val Glu Val Tyr Lys Asn Arg Ala Gly Asn Lys Ala Ser 
Gin Tyr Arg Val Glu vax ±y 3 110 
100 103 

m Teil Ala Tvr Lys Ala Ser Pro Gly Asn Leu 
Lys Glu Asn Asp Trp Tyr Leu Ala Tyr Lys ^ 

115 120 
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Sar Gin Ph. Gin A* IU L.n P»a Gly Aan Asp «.t Sar M. S.r 

130 135 
„. Sly Val Gl, Val L„ Mat S.r M, Val - Gly Gin W Gin 
145 150 

Val Gly val Gly Tyr Val Asp Sar Ha Gin Arg Lya u. Gly U. Cys 
165 170 

01 a Pha Pro Asp 1- «P Gin Pha Sax Asn tan Gin Ma L.n L.n Ha 

180 185 
sl „ XI. Gly Pro Lys Gin Cys Val Lan Pro Gly Gly Gin Thr M. Gly 

195 200 
Asp Mat Gly Lys Lan Ar, Gin Ha Ha Gin Arg Gly Gly Ha Lan Ha 

210 215 
Tor Gin Ar, Lys Lys Ma MP Pha Sar Tor Lys MP Ha Tyr Gin Asp 
225 ZJU 

L ao Asn Arg Lan Lan Lys Gly Lys Lys Gly Gin Gin Mat » Sar Ma 
245 " U 

val Lan Pro Gin Mat Gin Asn Gin Val Ma Val Sar Sar Lan Sar M. 

260 265 
V.1 H. Lys Pha Lan Glu Lan Lan Sar Asp Asp S.r Asn Pha Gly Gin 

275 280 
P„a Gin M n Thr Thr Pha Asp Ph. Sar Gin Tyr Mat Lys L.n Asp 11. 
290 295 

, T Ta„ Phe Gin Gly Ser Val Glu Asp Thr 

Ala Ala Val Arg Ala Leu Asn Leu Phe em toy ^ 

305 310 

Thr Gly Ser Gin Ser Leu Ala Ala Leu Leu Asn Lys Cys Lys Thr Pro 



325 



„ i ».„ rin Tro He Lys Gin Pro Leu Met Asp 
Gin Gly Gin Arg Leu Val Asn Gin Trp i« y ^ 

340 345 t 



ly a Asn Arg He Gin Gin Arg Lan Asn L.n Val Gin Ma Ph. Val Gin 

355 360 
MP Ma Gin L.n Arg Gin Thr Lan Gin Gin Asp L- L.n Arg Arg Ph. 



375 380 
370 
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* Teu Ala Lys Lys Phe Gin Arg Gin Ala Ala Asn 
Pro Asp Leu Asn Arg Leu AXa uys y 40Q 

385 390 



L eu Gin Asp Cys T yr Arg Leu Tyr *. «* - Asn Gin Leu Pro Asn 
405 * iU 

val ne «. Ma «U Hi. ^ -» V "is -» S - - 

420 425 

.i vi Phe Val Thr Pro Leu Thr Asp Leu Arg Ser Asp Phe Ser 
Leu Ala Val Phe vaj. 445 

435 44 

Lys Ph e «. *i» « n. «» Thr ». ~ - « «" 

450 455 

Mn His «u - «. v.i * «• - - s •» as - ieu s ° r r. 

465 470 

„ Mg Gia ne Met » «P U» - * * - ~ - S " U 
485 

Ile sex M, Ma « >• - «T « - « «* «" So WS 
500 5US 

U. **P Sa, sa, Ma Sla - « * ^ ~ « £ ^ ^ 
515 " U 

«U Ciu «. v al Leu *sn « «. E „e Sex Tax ~ *P - 

535 

530 D 
sln Ly5 esa «» V.1 1- « « Ser Jj. Leu Tax Sex « » 
545 550 

«, 1, Tyx Thr « «- -» ^ «» «» Wa £ Ua 
565 570 

n. veX W a «. ^ vax « IX. J- - «* * « £ "° ^ 
580 585 

m Val Leu Ala Gin Leu Asp Ala Val Val Ser Phe 

Gin Thr Leu Asn Asp Val i,eu gQ5 

595 600 
a , B1 . V.. Sex - «, Ma P.. V.5 ,xo * V.5 « « Ma «. 

610 615 
L eu «. Lye =m 01, M, ne Xie Leu £ M. Ser ex g Hie Ma 
625 630 
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Cys Val Glu Val Gin Asp Glu He Ala Phe lie Pro Asn Asp Val Tyr 

Phe Glu Lys Asp Lys Gin Met Phe His lie lie Thr Gly Pro Asn Met 
660 665 87 0 

Gly Gly Lys Ser Thr Tyr lie Arg Gin Thr Gly Val lie Val Leu Met 
675 680 685 

Ala Gin lie Gly Cys Phe Val Pro Cys Glu Ser Ala Glu Val Ser He 
690 "5 700 

Val Asp Cys lie Leu Ala Arg Val Gly Ala Gly Asp Ser Gin Leu Lys 
705 710 715 

Gly Val Ser Thr Phe Met Ala Glu Met Leu Glu Thr Ala Ser lie Leu 
725 730 

ser Ala Thr Lys Asp Ser Leu He He He Asp Glu Leu Gly Arg 
740 745 

Gly Thr Ser Thr Tyr Asp Gly Phe Gly Leu Ala Trp Ala He Ser Glu 
755 • 760 765 

Tyr lie Ala Thr Lys lie Gly Ala Phe Cys Met Phe Ala Thr His Phe 
770 775 780 

His Glu Leu Thr Ala Leu Ala Asn Gin lie Pro Thr Val Asn Asn Leu 



785 



790 



795 800 



His Val Thr Ala Leu Thr Thr Glu Glu Thr Leu Thr Met Leu Tyr Gin 



805 



810 



Val Lys Lys Gly Val Cys Asp Gin Ser Phe Gly He His Val Ala Glu 

825 



820 



Leu Ala Asn Phe Pro Lys His Val He Glu Cys Ala Lys Gin Lys Ala 

8 4 o 



835 



840 



L eu Glu Leu Glu Glu Phe Gin Tyr He Gly Glu Ser Gin Gly Tyr Asp 

850 355 860 

lie Met Glu Pro Ala Ala Lys Lys Cys Tyr Leu Glu Arg Glu Gin Gly 



865 



870 



875 880 



Olu Lys lie lie Gin Glu Phe Leu Ser Lys Val Lys Gin Met Pro Phe 



885 890 895 
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Thr Glu Met Ser Glu Glu Asn He Thr He Lys Leu Lys Gin Leu Lys 
900 905 910 

Ala Glu Val He Ala Lys Asn Asn Ser Phe Val Asn Glu He He Ser 
915 920 925 

Arg He Lys Val Thr Thr 
930 



<210> 21 
<211> 3145 
<212> DNA 

<213> Homo sapiens 
<400> 21 

ggcgggaaac agcttagtgg gtgtggggtc 
gtttcgacat ggcggtgcag ccgaaggaga 
gcttcgtgcg cttctttcag ggcatgccgg 
accggggcga cttctatacg gcgcacggcg 
tcaagaccca gggggtgatc aagtacatgg 
ttgtgcttag taaaatgaat tttgaatctt 
atagagttga agtttataag aatagagctg 
atttggcata taaggcttct cctggcaatc 
acaatgatat gtcagcttcc attggtgttg 
agagacaggt tggagttggg tatgtggatt 
tccctgataa tgatcagttc tccaatcttg 
aatgtgtttt acccggagga gagactgctg 
aaagaggagg aattctgatc acagaaagaa 
atcaggacct caaccggttg ttgaaaggca 
tgccagaaat ggagaatcag gttgcagttt 
aactcttatc agatgattcc aactttggac 
agtatatgaa attggatatt gcagcagtca 
aagataccac tggctctcag tctctggctg 
gacaaagact tgttaaccag tggattaagc 
agagattgaa tttagtggaa gcttttgtag 
aagatttact tcgtcgattc ccagatctta 
cagcaaactt acaagattgt taccgactct 
tacaggctct ggaaaaacat gaaggaaaac 
ctcctcttac tgatcttcgt tctgacttct 
tagatatgga tcaggtggaa aaccatgaat 
tcagtgaatt aagagaaata atgaatgact 
gtgcagccag agatcttggc ttggaccctg 
agtttggata ttactttcgt gtaacctgta 
actttagtac tgtagatatc cagaagaatg 
ctttaaatga agagtatacc aaaaataaaa 
ttaaagaaat tgtcaatatt tcttcaggct 
tgttagctca gctagatgct gttgtcagct 



gcgcattttc ttcaaccagg aggtgaggag 60 
cgctgcagtt ggagagcgcg gccgaggtcg 120 
agaagccgac caccacagtg cgccttttcg 180 
aggacgcgct gctggccgcc cgggaggtgt 240 
ggccggcagg agcaaagaat ctgcagagtg 300 
ttgtaaaaga tcttcttctg gttcgtcagt 360 
gaaataaggc atccaaggag aatgattggt 420 
tctctcagtt tgaagacatt ctctttggta 480 
tgggtgttaa aatgtccgca gttgatggcc 540 
ccatacagag gaaactagga ctgtgtgaat 600 
aggctctcct catccagatt ggaccaaagg 660 
gagacatggg gaaactgaga cagataattc 720 
aaaaagctga cttttccaca aaagacattt 780 
aaaagggaga gcagatgaat agtgctgtat 840 
catcactgtc tgcggtaatc aagtttttag 900 
agtttgaact gactactttt gacttcagcc 960 
gagcccttaa cctttttcag ggttctgttg 1020 
ccttgctgaa taagtgtaaa acccctcaag 1080 
agcctctcat ggataagaac agaatagagg 1140 
aagatgcaga attgaggcag actttacaag 1200 
accgacttgc caagaagttt caaagacaag 1260 
atcagggtat aaatcaacta cctaatgtta 1320 
accagaaatt attgttggca gtttttgtga 1380 
ccaagtttca ggaaatgata gaaacaactt 1440 
tccttgtaaa accttcattt gatcctaatc 1500 
tggaaaagaa gatgcagtca acattaataa 1560 
gcaaacagat taaactggat tccagtgcac 1620 
aggaagaaaa agtccttcgt aacaataaaa 1680 
gtgttaaatt taccaacagc aaattgactt 1740 
cagaatatga agaagcccag gatgccattg 1800 
atgtagaacc aatgcagaca ctcaatgatg 1860 
ttgctcacgt gtcaaatgga gcacctgttc 1920 
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catatgtacg accagccatt ttggagaaag 
ggcatgcttg tgttgaagtt caagatgaaa 
aaaaagataa acagatgttc cacatcatta 
atattcgaca aactggggtg atagtactca 
agtcagcaga agtgtccatt gtggactgca 
aattgaaagg agtctccacg ttcatggctg 
ctgcaaccaa agattcatta ataatcatag 
atggatttgg gttagcatgg gctatatcag 
gcatgtttgc aacccatttt catgaactta 
ataatctaca tgtcacagca ctcaccactg 
agaaaggtgt ctgtgatcaa agttttggga 
agcatgtaat agagtgtgct aaacagaaag 
gagaatcgca aggatatgat atcatggaac 
agcaaggtga aaaaattatt caggagttcc 
aaatgtcaga agaaaacatc aca,ataaagt 
agaataatag ctttgtaaat gaaatcattt 
cagtaatgga atgaaggtaa tattgataag 
atattaaccc tttttccata gtgttaactg 
atatttagta atattttact ttgaggacat 
gctgtaactg aggactgttt gcaattgaca 
ataaataaaa tcatgtagtt tgtgg 



gacaaggaag aattatatta aaagcatcca 19B0 
ttgcatttat tcctaatgac gtatactttg 2040 
ctggccccaa tatgggaggt aaatcaacat 2100 
tggcccaaat tgggtgtttt gtgccatgtg 2160 
tcttagcccg agtaggggct ggtgacagtc 2220 
aaatgttgga aactgcttct atcctcaggt 2280 
atgaattggg aagaggaact tctacctacg 2340 
aatacattgc aacaaagatt ggtgcttttt 2400 
ctgccttggc caatcagata ccaactgtta 24 60 
aagagacctt aactatgctt tatcaggtga 2520 
ttcatgttgc agagcttgct aatttcccta 2580 
ccctggaact tgaggagttt cagtatattg 2640 
cagcagcaaa gaagtgctat ctggaaagag 2700 
tgtccaaggt gaaacaaatg ccctttactg 2760 
taaaacagct aaaagctgaa gtaatagcaa 2820 
cacgaataaa agttactacg tgaaaaatcc 2880 
ctattgtctg taatagtttt atattgtttt 2940 
tcagtgccca tgggctatca acttaataag 3000 
tttcaaagat ttttattttg aaaaatgaga 3060 
taggcaataa taagtgatgt gctgaatttt 3120 

3145 



<210> 22 
<211> 756 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Met Ser Phe Val Ala Gly Val lie Arg Arg Leu Asp Glu Thr Val Val 
15 10 15 

Asn Arg He Ala Ala Gly Glu Val He Gin Arg Pro Ala Asn Ala He 
20 25 30 

Lys Glu Met He Glu Asn Cys Leu Asp Ala Lys Ser Thr Ser He Gin 
35 40 45 

Val He Val Lys Glu Gly Gly Leu Lys Leu He Gin He Gin Asp Asn 
50 55 60 

Gly Thr Gly He Arg Lys Glu Asp Leu Asp He Val Cys Glu Arg Phe 
65 70 75 80 

Thr Thr Ser Lys Leu Gin Ser Phe Glu Asp Leu Ala Ser He Ser Thr 
85 90 95 

Tyr Gly Phe Arg Gly Glu Ala Leu Ala Ser He Ser His Val Ala His 
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100 



105 I" 



Val Thr He Thr Thr Lys Thr Ala Asp Gly Lys Cys Ala Tyr Arg Ala 
115 120 125 

Ser Tyr Ser Asp Gly Lys Leu Lys Ala Pro Pro Lys Pro Cys Ala Gly 
130 " 135 "0 



Gin Gly Thr Gin He Thr Val Glu Asp Leu Phe Tyr Asn He Ala 
150 155 l g0 



Asn 
145 

Thr Arg Arg Lys Ala Leu Lys Asn Pro Ser Glu Glu Tyr Gly Lys He 
165 170 i75 

Leu Glu Val Val Gly Arg Tyr Ser Val His Asn Ala Gly He Ser Phe 
180 185 I 90 

Ser Val Lys Lys Gin Gly Glu Thr Val Ala Asp Val Arg Thr Leu Pro 
195 200 205 

Asn Ala Ser Thr Val Asp Asn He Arg Ser He Phe Gly Asn Ala Val 
210 215 220 

Ser Arg Glu Leu He Glu He Gly Cys Glu Asp Lys Thr Leu Ala Phe 
225 230 235 240 

Lys Met Asn Gly Tyr He Ser Asn Ala Asn Tyr Ser Val Lys Lys Cys 
245 250 255 



lie Phe Leu Leu Phe He Asn His Arg Leu Val Glu Ser Thr Ser Leu 
260 



265 270 



Arg Lys Ala He Glu Thr Val Tyr 
275 



Val Tyr Ala Ala Tyr Leu Pro Lys Asn Thr 



280 285 



His Pro Phe Leu Tyr Leu Ser Leu Glu lie Ser Pro Gin Asn Val Asp 
290 



295 300 



Val Asn Val His Pro Thr Lys His Glu Val His Phe Leu His Glu Glu 
305 310 315 320 

Ser He Leu Glu Arg Val Gin Gin His "lie Glu Ser Lys Leu Leu Gly 
325 330 335 

Ser Asn Ser Ser Arg Met Tyr Phe Thr Gin Thr Leu Leu Pro Gly Leu 
340 345 350 

Ala Gly Pro Ser Gly Glu Met Val Lys Ser Thr Thr Ser Leu Thr Ser 
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355 

S er Her Thr Ser Gly Ser Sex Asp L». Val Tyr Ala Hi, =1. Het Val 
370 375 

n„ Tus teu Asp Ala Phe Leu Gin Pro Leu 
Arg Thr Asp Ser Arg Slu Gin Lys Leu Asp a ^ 

385 390 

SK Lys Pro u, s« s.r gi» « «» «J n. ~ - slu £ lys 

405 4XU 

, a «.r Glv Arg Ala Arg Gin Gin Asp Glu Glu Met Leu 
Thr Asp He Ser Ser Gly Arg ftJ-a y ^ 

420 425 
«, Leu Pro Ala Pro Ala Glu Val Ma Ala Lys Asn Gin Ser Leu Glu 

435 440 
als MP Thr Thr Lys Gly Thr Ser Glu ■* Ser Glu Lys Gly Pro 

450 155 
Iht ser S er Asn Pro Ar 5 Lys Ar, His Ar g Glu Asp Ser Asp Val Glu 
465 470 

Mrt val Glu Asp Asp ser X, Lys Glu Met Thr Ala Ala Cys Thr Pro 

485 490 
Mg Ug lie He Asa Leu Thr Ser Val Leu Ser Leu Gin Glu Glu 
500 505 
Ile *sn Glu Gin Gly His Glu Vel Leu „ Glu Met Leu His Asn His 

515 520 
ser Phe val Gly Cys Val Asn Pro Gin Trp Ala Leu Ala Gin His Gin 
530 535 

T a«n Thr Thr Lys Leu Ser Glu Glu Leu Phe 
Thr Lys Leu Tyr Leu Leu Asn Thr Tnr nys 56Q 

545 550 " 

... Tvr ^ p h e Ala Asn Phe Gly Val Leu Arg Leu 
Tyr Gin Ile Leu He Tyr Asp me ^ 575 

565 

« Glu Pro Ala Pro Leu Ph. Asp Leu Ala Met Leu Ala Leu Asp Ser 

580 585 
P r „ GLU Ser Gly Trp Thr Glu Glu Asp Gly Pro Lys Glu Gly Leu Ala 
595 600 
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615 620 

610 



» n„ Glu Glv Asn Leu He 61y Leu Pro 
Phe Ser Leu Glu He Asp Glu Glu Oly ^ 64Q 

Glu Gly Leu Pro He Phe 



Tyr Phe Ser beu — — g35 
625 



it=t Pro Pro Leu Giu fj.y w " 

Le u Leu He Asp Asn Tyr Val Pro Pro * ^ 

645 

660 665 

t Gin CVS Ala Met Phe Tyr Ser lie Arg Lys 
Phe Glu Ser Leu Ser Lys Glu Cys Ala ^ 

675 680 

r, , Glu Ser Thr Leu Ser Gly Gin Gin Ser Glu Val 
Gin Tyr He Ser Glu Glu Ser Ttir ^ 

690 695 

Thr Val Glu His He Val 
Pr „ Gly ser He Pro Asn Ser Irp L,. Trp Thr 

705 

TyI Ws U . ». «. ser «. Ue £ « «. - - ~ 
725 

n, to,, Ala Asn Leu Pro Asp Leu Tyr Lys Val 
Asp Gly Asn He Leu Gin Leu Ala Asn ^ 
740 



Phe Glu Arg Cys 
755 



<210> 23 

<211> 2484 

<212> DNA 

<213> Homo sapiens 



< 400> 23 ^rcattc gtggcagggg ttattcggcg gctggacgag 60 

cttggotctt ctggcgccaa aatgtcgttc gtgg gctaa tgctatcaaa 120 

acagtggtga accgoatcgo ggcgggggaa Scaagtgat tgttaaagag 180 

gagatgattg agaactgttt agatgcaaa tccaC a g ^ ^ 40 

ggaggcctga agttgattca gatccaagac JJ« cct ttgagga tttagcoagt 300 

gatattgtat gtgaaaggtt cactactagt aaactg g ^ ggctcatgtt 360 

atttctacct atggctttcg aggtgaggct "W^* tta ct cagatgga 420 

fctattacaa cgaaaacagc tgatggaaag ^ cacggtgga g 48 

aaactgaaag ccootcctaa aocatgtgct W« aaaa tocaag tgaagaatat 540 

ess sss === =b= =< = 
5= "= sss &. — — t9t 720 
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gaggataaaa ccctagcctt caaaatgaat 
aagaagtgca tcttcttact cttcatcaac 
aaagccatag aaacagtgta tgcagcctat 
ctcacjtttag aaatcagtcc ccagaatgtg 
gttcacttcc tgcacgagga gagcatcctg 
ctcctgggct ccaattcctc caggatgtac 
ggcccctctg gggagatggt taaatccaca 
agtagtgata aggtctatgc ccaccagatg 
gatgcatttc tgcagcctct gagcaaaccc 
gaggataaga cagatatttc tagtggcagg 
ctcccagccc ctgctgaagt ggctgccaaa 
gggacttcag aaatgtcaga gaagagagga 
cgggaagatt ctgatgtgga aatggtggaa 
tgtacccccc ggagaaggat cattaacctc 
aatgagcagg gacatgaggt tctccgggag 
gtgaatcctc agtgggcctt ggcacagcat 
aagcttagtg aagaactgtt ctaccagata 
ctcaggttat cggagccagc accgctcttt 
gagagtggct ggacagagga agatggtccc 
tttctgaaga agaaggctga gatgcttgca 
gggaacctga ttggattacc ccttctgatt 
cctatcttca ttcttcgact agccactgag 
gaaagcctca gtaaagaatg cgctatgttc 
gagtcgaccc tctcaggcca gcagagtgaa 
tggactgtgg aacacattgt ctataaagcc 
ttcacagaag atggaaatat cctgcagctt 
gagaggtgtt aaatatggtt atttatgcac 
cgatacaaag tgttgtatca aagtgtgata 
cacttaagac ttatacttgc cttctgatag 
aataaataga tgtgtcttaa cata 



ggttacatat ccaatgcaaa ctactcagtg 780 
catcgtctgg tagaatcaac ttccttgaga 840 
ttgcccaaaa acacacaccc attcctgtac 900 
gatgttaatg tgcaccccac aaagcatgaa 960 
gagcgggtgc agcagcacat cgagagcaag 1020 
ttcacccaga ctttgctacc aggacttgct 1080 
acaagtctga cctcgtcttc tacttctgga 1140 
gttcgtacag attcccggga acagaagctt 1200 
ctgtccagtc agccccaggc cattgtcaca 1260 
gctaggcagc aagatgagga gatgcttgaa 1320 
aatcagagct tggaggggga tacaacaaag 1380 
cctacttcca gcaaccccag aaagagacat 1440 
gatgattccc gaaaggaaat gactgcagct 1500 
actagtgttt tgagtctcca ggaagaaatt 1560 
atgttgcata accactcctt cgtgggctgt 1620 
caaaccaagt tataccttct caacaccacc 1680 
ctcatttatg attttgccaa ttttggtgtt 1740 
gaccttgcca tgcttgcctt agatagtcca 1800 
aaagaaggac ttgctgaata cattgttgag 1860 
gactatttct ctttggaaat tgatgaggaa 1920 
gacaactatg tgcccccttt ggagggactg 1980 
gtgaattggg acgaagaaaa ggaatgtttt 2040 
tattccatcc ggaagcagta catatctgag 2100 
gtgcctggct ccattccaaa ctcctggaag 2160 
ttgcgctcac acattctgcc tcctaaacat 2220 
gctaacctgc ctgatctata caaagtcttt 2280 
tgtgggatgt gttcttcttt ctctgtattc 2340 
tacaaagtgt accaacataa gtgttggtag 2400 
tattccttta tacacagtgg attgattata 2460 

2484 



<210> 24 
<211> 133 
<212> PRT 

<213> Homo sapiens 
<400> 24 

Met Lys Gin Leu Pro Ala Ala Thr Val Arg Leu Leu Ser Ser Ser Gin 
15 10 15 

lie lie Thr Ser Val Val Ser Val Val Lys Glu Leu He Glu Asn Ser 
20 25 30 

Leu Asp Ala Gly Ala Thr Ser Val Asp Val Lys Leu Glu Asn Tyr Gly 
35 40 45 ' 

Phe Asp Lys He Glu Val Arg Asp Asn Gly Glu Gly He Lys Ala Val 
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55 60 
50 M 



Mp Ala Pro V31 Met M. Met Lys Tyr Tyr Thr Ser Lys He Asn Ser 

65 70 " 

Eis Glu Asp Leu Glu Asn Leu Thr Thr Tyr Gly Phe Ar, Gly Glu Ma 
85 90 

U. Gly ser He Cys Cys Tie Ma Glu Val Leu Tie Thr Thr Ar 9 Thr 

100 105 
Ma Ala Asp Asn Phe Ser Thr Gin Tyr Val Leu Asp Gly Ser Gly His 
115 120 

He Leu Ser Gin Lys 
130 



<210> 25 
<2H> 426 
<212> DNA 

<213> Homo sapiens 

XS- f— =! SSS SS S3S S. 

aaggccatca aacctattga tcggaagtca ^ccatcaga «*g w ^ 180 

ctgagtctaa gcactgcggt aaaggagtta ^agaaaaca 24Q 
aatattgato taaagcttaa ggactatgga ^ggatctta ttgaagt ^, ^ ^ 

tgtggggtag aagaagaaaa cttcgaaggc ttaactctga ^ ^ 
caagagtttg cogacctaac act gg tcggc ga aggttg g a 420 

tcactttgtg cactgagcga tgtcaccatt tctacc g * 426 

acttga 



<210> 26 
<211> 1360 
<212> PRT 

<213> Homo sapiens 



<400> 26 . p Lvs Ser pro Ala 

Met Ser Arg Gin Ser Thr Leu Tyr Ser Phe Phe Lys ^ 

5 

j, Ser Asp Ma Asn Lys Ms Ser Ma Ar 9 Ma Ser Ar g Glu Gly Gl, 

20 25 
Mg Ma Ma Ms M. Pro Gly Ma Ser Pre Ser Pro Gly Gly Asp Ma 



35 
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Ala Trp Ser Glu Ala Gly Pro Gly Pro Arg Pro Leu Ala Arg Ser Ala 

50 55 60 

Ser Pro Pro Lys Ala Lys Asn Leu Asn Gly Gly Leu Arg Arg Ser Val 

65 ™ 75 

Ma Pro Ala Ala Pro Thr Ser Cys Asp Phe Ser Pro Gly Asp Leu Val 
85 90 

Trp Ala lys Met Glu Gly Tyr Pro Trp Trp Pro cys Leu Val Tyr Asn 



100 



105 



His Pro Phe Asp Gly Thr Phe lie Arg Glu Lys Gly Lys Ser Val Arg 

115 120 125 

Val His Val Gin Phe Phe Asp Asp Ser Pro Thr Arg Gly Trp Val Ser 

130 "5 "0 

Lys Arg Leu Leu Lys Pro Tyr Thr Gly Ser Lys Ser Lys Glu Ala Gin 
145 150 155 

Lys Gly Gly His Phe Tyr Ser Ala Lys Pro Glu lie. Leu Arg Ala Met 
165 170 

Gin AX, Ala Asp Glu Ala leu Asn lys Asp lys lis lys Ar, «« «- 

180 185 
« Ala val cys Asp Glu Pro Ser Glu Pro Glu Glu Glu Glu Glu Mat 

135 200 
Olu val Gly Thr Thr Tyr Val Thr Asp lys Ser Glu Glu Asp Asn Glu 

210 2 " 
ne Glu ser Glu Glu Glu val Gin Pro lys Thr S ln Gly Ser Ar, Ar, 
225 230 

ser ser Ar, Gin He lys lys Ar, Ar, Val He Ser Asp Ser Glu Ser 
245 250 

ASP lie Gly Gly Ser Asp Val Glu Phe lys Pro Asp Thr lys Glu Glu 
260 265 

. no,, riu He Ser Ser Gly Val Gly Asp Ser Glu Ser Glu 
Gly Ser Ser Asp Glu lie s>er oet * 



275 



280 



Gly Asn ser Pro Val lys val Ala Ar, lys Ar, lys Ar, Met Val 
290 295 

30 
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Thr Gly As„ Gly s.r Leu Lys Ar, Lys Ser Ser Ar, Lys Glu Thr Pro 



305 



310 



Ser U. Thr Lys Gin Ala Thr Ser He Ser Sex Glu Ihr Lys Aan Thr 
325 330 

L eu Arg Ala Phe Ser Ala Pro Gin Asn Ser Glu Ser Gin Ala His Val 
340 345 

ssr Gly Gly Gly Asp *» ■« •« P " Iht Val £ ^ ^ 

355 360 
Thr Leu Glu Trp Leu Lys Glu Glu Lys Ar, Arg £ Glu Hi, Ar, X. 

370 375 
Ar, Pro Asp His Pro ASP Phe Asp Ale Ser Thr Leu Tyr Val Pro Glu 
385 39° 395 

Asp Phe Leu Asn Ser Cys Thr Pro Gly Met Ar, Lys Trp Trp Gin He 

, ys ser Gin Asn Phe Asp Leu Val II. Cys Tyr Lys « Gly Lys Phe 

420 425 
Tyr Glu Leu Tyr His Bet Asp Ale Leu He Gly Vel Ser Glu Leu Gly 

435 440 
Leu V.l Phe Met Lys Gly Asn Trp Ala Hia Ser Gly Phe Pro Glu He 

450 4 " "° 

Ala Phe Gly Ar g Tyr Ser Asp Ser Leu Val Gin Lys Gly Tyr Lys Val 
465 "0 " 5 

U. Arg val Glu Gin Thr Glu Thr Pro Glu Met Met Glu Ala Arg Cys 

«, Lys Met Ala His He ser Lys Tyr Asp Ar, val Val Ar, Ar, Glu 

500 505 
lie cys Arg lie U. Thr Lys Gly Thr Gin Thr Tyr Ser val Leu Glu 

515 520 
G l„ Asp Pro S.r Glu Asn Tyr Ser Lys Tyr Leu Leu Ser Leu Lys Glu 

530 535 
Ly s Glu Glu Asp ser Ser Gly His Thr Ar, Ala Tyr Gly Val Cys she 

31 
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Val Asp Thr Ser Leu Gly Lys Phe Phe lie Gly Gin Phe Ser Asp Asp 
565 570 575 

Arg His Cys Ser Arg Phe Arg Thr Leu Val Ala His Tyr Pro Pro Val 
580 585 590 

Gin Val Leu Phe Glu Lys Gly Asn Leu Ser Lys Glu Thr Lys Thr He 
595 600 605 

Leu Lys Ser Ser Leu Ser Cys Ser Leu Gin Glu Gly Leu He Pro Gly 
610 615 620 

Ser Gin Phe Trp Asp Ala Ser Lys Thr Leu Arg Thr Leu Leu Glu Glu 
625 630 635 640 

Glu Tyr Phe Arg Glu Lys Leu Ser Asp Gly He Gly Val Met Leu Pro 



645 



650 



655 



Gin Val Leu Lys Gly Met Thr Ser Glu Ser Asp Ser He Gly Leu Thr 
660 665 670 



Pro Gly Glu Lys Ser Glu Leu Ala Leu Ser Ala Leu Gly Gly Cys Val 
675 680 685 

Phe Tyr Leu Lys Lys Cys Leu He Asp Gin Glu Leu Leu Ser Met Ala 
690 695 700 

Asn Phe Glu Glu Tyr He Pro Leu Asp Ser Asp Thr Val Ser Thr Thr 

715 720 



705 



710 



Arg Ser Gly Ala He Phe Thr Lys Ala Tyr Gin Arg Met Val Leu Asp 
725 730 735 

Ala Val Thr Leu Asn Asn Leu Glu He Phe Leu Asn Gly Thr Asn Gly 
740 745 750 

Ser Thr Glu Gly Thr Leu Leu Glu Arg Val Asp Thr Cys His Thr Pro 
755 760 765 

Phe Gly Lys Arg Leu Leu Lys Gin Trp^Leu Cys Ala Pro Leu Cys Asn 
770 775 780 

His Tyr Ala He Asn Asp Arg Leu Asp Ala He Glu Asp Leu Met Val 

795 800 



785 



790 



Val Pro Asp Lys He Ser Glu Val Val Glu Leu Leu Lys Lys Leu Pro 

810 815 



805 
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MP «. - - - - - - >» vai Giy s pto Le ° 

820 823 

Ly5 Ser Glu « His Pro ^ Ser te9 *!* I- - £ - - - 

835 840 

t He ASP Phe Leu Ser Ala Leu Glu Gly 

Thr Tyr Ser Lys Lys Lys lie He Asp ^ 

850 855 

T * tip Tie GlV He Met Glu Glu Val Ala Asp 
Phe Lys Val Met Cys Lys He He Gly lie «^ ^ 

865 870 

u=i Tie Ser Leu Gin Thr Lys 
Gly Phe Lys Ser Lys He Leu Lys Gin Val He ^ 

885 w * 

to Pro Glu Gly ^ P-e » -P « ^ ~ G1U LM ^ ^ 

900 903 

M p Ih r Ala Phe asp Bis Glu lys Ala X, V * £ - - 
915 920 

, nv, c P r asp Tvr Asp Gin Ala Leu Ala Asp He 

Pro Lys Ala Gly Phe Asp Ser Asp xyr a p 

930 935 
^ G 1 U Asn Glu Gin Ser Leu Leu Glu T yr Leu Glu Lys Gin Ar g Asn 
945 950 

„, T , . Ual Tvr Trp Gly He Gly Arg Asn Arg 
Arg He Gly Cys Arg Thr He Val Tyr Trp y ^ 



965 970 



Tyr Gin Leu Glu He Pro Glu Asn 



Phe Thr Thr Arg Asn Leu Pro Glu 



985 



990 



980 

Glu Tyr Glu leu lye Ser Tnr lys lys Gly Cys l,s Tyr Trp Tar 
995 100 

„ JS Thr He Glu lys lye « - - - ^ 
1010 1015 

Mg Asp vel ser Leu lye «P Cys Her «, « - * - * 
1025 1030 

^ Lys Asu Tyr lys Asp Trp Glu Ser Ala V.1 Glu Cys He Ma Val 
1045 1U5U 

„ t«„ Til a asn Tyr Ser Arg Gly Gly Asp Gly 
Leu Asp Val Leu Leu Cys Leu Ala Asn Tyr 

1060 1065 
33 
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Pro Met Cys Ar, Pro Val He Len Len Pro «» - »J »» « "» 
1075 1080 

« Gin Len Lys Gl, Her Ar g His Pro Cys he™ Lye - - 

1090 1095 

. p he He Pro Asn Asp lie Leu lie Gly Cys Glu Glu Glu 
Gly Asp Asp Phe lie Fro ash «=f 1120 

1105 1110 

^ t„ c Ma Tvr Cvs Val Leu Val Thr Gly Pro Asn 
Glu Gin Glu Asn Gly Lys Ala Tyr n35 



1125 



Het Gly Gly Lye sex Thr Lea Met Are Gin Ma Gly Leu Lee Ale VI 



1140 



1145 



Me t Me Gin Met Gly Cys Tyr Val Pro Ala Glu V.l*. Ar 9 Len Tnr 



1155 



1160 



» He asp AT, Val Phe Thr Ar, Lea Gly Ale^er Asp Are II. Met 

1170 1175 
ser Gly Gin Ser Thr Phe Phe Val Gin Lender Gin Thr Ale Ser Uj 



1185 



1190 



L en Met His Ale Thr Ala His Her hen Val hen Val Asp Glu^u Gly 
1205 1210 

Mg G1 y Thr Al. Thr Phe Asp Gly Thr Ala He Ala Asn Ala Val Val 

1220 1225 
Lys Gin hen Ala Gin Thr He Lye Cys Ar, Thr Len^he Ser Thr His 

1235 1240 
tyr His ser Len val Gin Asp Tyr Her Gin Aen^al Ala val Ar g Len 
1250 1255 



Gly His Met Ala cys Met Val Gin Asn Gin Cys Gin Asp Pro Harbin 
1265 1270 1215 

Glu Thr lie Thr Phe l*n Tyr Lys Phe lie Lye Gly Ala Cys Pro Lys 

1285 1290 
ser Tyr Gly Phe Asn Ala Ala Ar g Len Ala Asn Len Pro Gin Gin Val 



1300 



1305 



II. Gin Lye Gly His Ar, Lye Ale Ar g Gin Phe Gin Lye Met Aen Gin 
1315 1320 
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Ser Leu Arg Leu Phe Arg Glu Val Cys Leu Ala Ser Glu Arg Ser Thr 
1330 1335 1340 

Val Asp Ala Glu Ala Val His Lys Leu Leu Thr Leu lie Lys Glu Leu 
1345 1350 1355 1360 



<210> 27 
<211> 4244 
<212> DNA 

<213> Homo sapiens 
<400> 27 

gccgcgcggt agatgcggtg cttttaggag 
ggctgtcggt atgtcgcgac agagcaccct 
gagtgatgcc aacaaggcct cggccagggc 
ccccggggcc tctccttccc caggcgggga 
caggcccttg gcgcgctccg cgtcaccgcc 
gagatcggta gcgcctgctg cccccaccag 
ggccaagatg gagggttacc cctggtggcc 
aacattcatc cgcgagaaag ggaaatcagt 
cccaacaagg ggctgggtta gcaaaaggct 
ggaagcccag aagggaggtc atttttacag 
acgtgcagat gaagccttaa ataaagacaa 
tgagccctca gagccagaag aggaagaaga 
taagagtgaa gaagataatg aaattgagag 
atctaggcga agtagccgcc aaataaaaaa 
cattggtggc tctgatgtgg aatttaagcc 
aataagcagt ggagtggggg atagtgagag 
tcgaaagcgg aagagaatgg tgactggaaa 
ggaaacgccc tcagccacca aacaagcaac 
gagagctttc tctgcccctc aaaattctga 
tgacagtagt cgccctactg tttggtatca 
gagaagagat gagcacagga ggaggcctga 
tgtgcctgag gatttcctca attcttgtac 
gtctcagaac tttgatcttg tcatctgtta 
catggatgct cttattggag tcagtgaact 
ccattctggc tttcctgaaa ttgcatttgg 
ctataaagta gcacgagtgg aacagactga 
aaagatggca catatatcca agtatgatag 
taccaagggt acacagactt acagtgtgct 
gtatcttctt agcctcaaag aaaaagagga 
tgtgtgcttt gttgatactt cactgggaaa 
ccattgttcg agatttagga ctctagtggc 



ctccgtccga cagaacggtt gggccttgcc 60 
gtacagcttc ttccccaagt ctccggcgct 120 
ctcacgcgaa ggcggccgtg ccgccgctgc 180 
tgcggcctgg agcgaggctg ggcctgggcc 240 
caaggcgaag aacctcaacg gagggctgcg 300 
ttgtgacttc tcaccaggag atttggtttg 360 
ttgtctggtt tacaaccacc cctttgatgg 420 
ccgtgttcat gtacagtttt ttgatgacag 480 
tttaaagcca tatacaggtt caaaatcaaa 540 
tgcaaagcct gaaatactga gagcaatgca 600 
gattaagagg cttgaattgg cagtttgtga 660 
gatggaggta ggcacaactt acgtaacaga 720 
tgaagaggaa gtacagccta agacacaagg 780 
acgaagggtc atatcagatt ctgagagtga 840 
agacactaag gaggaaggaa gcagtgatga 900 
tgaaggcctg aacagccctg tcaaagttgc 960 
tggctctctt aaaaggaaaa gctctaggaa 1020 
tagcatttca tcagaaacca agaatacttt 1080 
atcccaagcc cacgttagtg gaggtggtga 1140 
tgaaacttta gaatggctta aggaggaaaa 1200 
tcaccccgat tttgatgcat ctacactcta 1260 
tcctgggatg aggaagtggt ggcagattaa 1320 
caaggtgggg aaattttatg agctgtacca 1380 
ggggctggta ttcatgaaag gcaactgggc 1440 
ccgttattca gattccctgg tgcagaaggg 1500 
gactccagaa atgatggagg cacgatgtag 1560 
agtggtgagg agggagatct gtaggatcat 1620 
ggaaggtgat ccctctgaga actacagtaa 1680 
agattcttct ggccatactc gtgcatatgg 1740 
gtttttcata ggtcagtttt cagatgatcg 1800 
acactatccc ccagtacaag ttttatttga 1860 
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aaaaggaaat ctctcaaagg aaactaaaac 
tcaggaaggt ctgatacccg gctcccagtt 
ccttgaggaa gaatatttta gggaaaagct 
ggtgcttaaa ggtatgactt cagagtctga 
tgaattggcc ctctctgctc taggtggttg 
tcaggagctt ttatcaatgg ctaattttga 
cagcactaca agatctggtg ctatcttcac 
agtgacatta aacaacttgg agatttttct 
cctactagag agggttgata cttgccatac 
gctttgtgcc ccactctgta accattatgc 
cctcatggtt gtgcctgaca aaatctccga 
tcttgagagg ctactcagta aaattcataa 
cccagacagc agggctataa tgtatgaaga 
ttttctttct gctctggaag gattcaaagt 
agttgctgat ggttttaagt ctaaaatcct 
tcctgaaggt cgttttcctg atttgactgt 
ccatgaaaag gctcgaaaga ctggacttat 
tgaccaagct cttgctgaca taagagaaaa 
acagcgcaac agaattggct gtaggaccat 
ccagctggaa attcctgaga atttcaccac 
atctaccaag aagggctgta aacgatactg 
tctcataaat gctgaagaac ggagggatgt 
ctataacttt gataaaaatt acaaggactg 
ggatgtttta ctgtgcctgg ctaactatag 
agtaattctg ttgccggaag ataccccccc 
ttgcattacg aagacttttt ttggagatga 
tgaggaagag gagcaggaaa atggcaaagc 
ggggggcaag tctacgctta tgagacaggc 
ttgttacgtc cctgctgaag tgtgcaggct 
tggtgcctca gacagaataa tgtcaggtga 
tgccagcata ctcatgcatg caacagcaca 
aggtactgca acatttgatg ggacggcaat 
gactataaaa tgtcgtacat tattttcaac 
tcaaaatgtt gctgtgcgcc taggacatat 
ccccagccag gagactatta cgttcctcta 
ctatggcttt aatgcagcaa ggcttgctaa 
tagaaaagca agagaatttg agaagatgaa 
cctggctagt gaaaggtcaa ctgtagatgc 
taaggaatta tagactgact acattggaag 
attcagacaa cattatgatc taataaactt 

<210> 28 

<211> 1128 

<212> PRT 

<213> Homo sapiens 

<400> 28 



aattctaaag agttcattgt cctgttctct 1920 
ttgggatgca tccaaaactt tgagaactct 1980 
aagtgatggc attggggtga tgttacccca 2040 
ttccattggg ttgacaccag. gagagaaaag 2100 
tgtcttctac ctcaaaaaat gccttattga 2160 
agaatatatt cccttggatt ctgacacagt 2220 
caaagcctat caacgaatgg tgctagatgc 2280 
gaatggaaca aatggttcta ctgaaggaac 2340 
tccttttggt aagcggctcc taaagcaatg 2400 
tattaatgat cgtctagatg ccatagaaga 2460 
agttgtagag cttctaaaga agcttccaga 2520 
tgttgggtct cccctgaaga gtcagaacca 2580 
aactacatac agcaagaaga agattattga 2640 
aatgtgtaaa attataggga tcatggaaga 2700 
taagcaggtc atctctctgc agacaaaaaa 2760 
agaattgaac cgatgggata cagcctttga 2820 
tactcccaaa gcaggctttg actctgatta 2880 
tgaacagagc ctcctggaat acctagagaa 2940 
agtctattgg gggattggta ggaaccgtta 3000 
tcgcaatttg ccagaagaat acgagttgaa 3060 
gaccaaaact attgaaaaga agttggctaa 3120 
atcattgaag gactgcatgc ggcgactgtt 3180 
gcagtctgct gtagagtgta tcgcagtgtt 3240 
tcgagggggt gatggtccta tgtgtcgccc 3300 
cttcttagag cttaaaggat cacgccatcc 3360 
ttttattcct aatgacattc taataggctg 3420 
ctattgtgtg cttgttactg gaccaaatat 3480 
tggcttatta gctgtaatgg cccagatggg 3540 
cacaccaatt gatagagtgt ttactagact 3600 
aagtacattt tttgttgaat taagtgaaac 3660 
ttctctggtg cttgtggatg aattaggaag 3720 
agcaaatgca gttgttaaag aacttgctga 3780 
tcactaccat tcattagtag aagattattc 3840 
ggcatgcatg gtagaaaatg aatgtgaaga 3900 
taaattcatt aagggagctt gtcctaaaag 3960 
tctcccagag gaagttattc aaaagggaca 4020 
tcagtcacta cgattatttc gggaagtttg 4080 
tgaagctgtc cataaattgc tgactttgat 4140 
ctttgagttg acttctgaca aaggtggtaa 4200 
tattttttaa aaat 4244 
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Met Ser Arg Arg Lys Pro Ala Ser Gly Gly Leu Ala Ala Ser Ser Ser 
15 10 15 

Ala Pro Ala Arg Gin Ala Val Leu Ser Arg Phe Phe Gin Ser Thr Gly 
20 25 30 

Ser Leu Lys Ser Thr Ser Ser Ser Thr Gly Ala Ala Asp Gin Val Asp 
35 40 45 

Pro Gly Ala Ala Ala Ala Ala Ala Pro Pro Ala Pro Ala Phe Pro Pro 
50 55 60 

Gin Leu Pro Pro His Val Ala Thr Glu lie Asp Arg Arg Lys Lys Arg 
65 70 75 80 

Pro Leu Glu Asn Asp Gly Pro Val Lys Lys Lys Val Lys Lys Val Gin 
85 90 95 

Gin Lys Glu Gly Gly Ser Asp Leu Gly Met Ser Gly Asn Ser Glu Pro 
100 105 HO 

Lys Lys Cys Leu Arg Thr Arg Asn Val Ser Lys Ser Leu Glu Lys Leu 
115 120 125 

Lys Glu Phe Cys Cys Asp Ser Ala Leu Pro Gin Ser Arg Val Gin Thr 
130 135 140 

Glu Ser Leu Gin Glu Arg Phe Ala Val Leu Pro Lys Cys Thr Asp Phe 
145 150 155 160 

Asp Asp He Ser Leu Leu His Ala Lys Asn Ala Val Ser Ser Glu Asp 
165 170 175 

Ser Lys Arg Gin He Asn Gin Lys Asp Thr Thr Leu Phe Asp Leu Ser 
180 185 190 

Gin Phe Gly Ser Ser Asn Thr Ser His Glu Asn Leu Gin Lys Thr Ala 
195 200 205 

Ser Lys Ser Ala Asn Lys Arg Ser Lys Ser lie Tyr Thr Pro Leu Glu 
210 215 > 220 

Leu Gin Tyr He Glu Met Lys Gin Gin His Lys Asp Ala Val Leu Cys 
225 230 235 240 

Val Glu Cys Gly Tyr Lys Tyr Arg Phe Phe Gly Glu Asp Ala Glu He 
245 250 255 
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Ala Ala Arg Glu Leu Asn He Tyr Cys His Leu Asp His Asn Phe Met 
260 265 

Thr Ala Ser He Pro Thr His Arg Leu Phe Val His Val Arg Arg Leu 

275 280 285 

Val Ala Lys Gly Tyr Lys Val Gly Val Val Lys Gin Thr Glu Thr Ala 



290 



295 



300 



Ala Leu Lys Ala He Gly Asp Asn Arg Ser Ser Leu Phe Ser Arg Lys 



305 



310 



320 



!eu Thr Ala Leu Tyr Thr Lys Ser Thr » He Gly Gl. Asp val Asn 



325 



330 



Pro Leu He Lys Leu Asp Asp Ala Val Asn Val Asp Glu lie Met Thr 
340 345 

Asp T hr ser Thr Ser Tyr L.» Len Cys He Ser Gin Asn Lys Gin As„ 
35S 360 365 

V.1 Ar 9 Asp Lys Lys Lys Gly Asn He Phe He Gly He Val Gly Val 

370 375 
Gin Pro Ala Thr Gly Glu Val Val Phe Asp Ser Phe Gin Asp Ser Ala 
385 • 390 3 

ser Ar, ser Gin Leu Glu Thr Ar, Met Ser Ser Leu Gin Pro val Glu 
405 410 

leu L e» Leu Pro Ser Ala Leu Ser Glu Gin Thr Glu Ala Leu He His 
420 425 

ug Ala Thr Ser Val Ser Val Gin Asp Asp Ar g He Arg Val Glu AT, 

435 4)0 " 

„et Asp Asn Ha Tyr Phe Glu Tyr Ser His Ala Phe Gin Ala val Thr 

acc 4oU 
450 455 

Glu Phe Tyr Ala Lys Asp Thr Val Asp He Lys Gly Ser Gin He He 

Aid 475 

465 4/u r 

S.r Gly He Val Asn Leu Glu Lys Pro Val He Cys Ser Leu Ala Ala 
485 490 

XI. lie Lys Tyr Len Lys Gin Phe Asn Leu Glu Lys Met Leu Ser Lys 
500 505 
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Pro Glu Asn Phe Lys Gin Leu Ser Ser Lys Met Glu Phe Met Thr He 
515 520 525 

Asn Gly Thr Thr Leu Arg Asn Leu Glu He Leu Gin Asn Gin Thr Asp 
530 535 540 

Met Lys Thr Lys Gly Ser Leu Leu Trp Val Leu Asp His Thr Lys Thr 
545 " 550 555 560 

Ser Phe Gly Arg Arg Lys Leu Lys Lys Trp Val Thr Gin Pro Leu Leu 
565 570 575 

Lys Leu Arg Glu He Asn Ala Arg Leu Asp Ala Val Ser Glu Val Leu 
580 585 590 

His Ser Glu Ser Ser Val Phe Gly Gin He Glu Asn His Leu Arg Lys 
595 600 605 

Leu Pro Asp He Glu Arg Gly Leu Cys Ser He Tyr His Lys Lys Cys 
610 615 620 

Ser Thr Gin Glu Phe Phe Leu He Val Lys Thr Leu Tyr His Leu Lys 
625 630 635 640 

Ser Glu Phe Gin Ala He He Pro Ala Val Asn Ser His He Gin Ser 
645 650 655 

Asp Leu Leu Arg Thr Val He Leu Glu He Pro Glu Leu Leu Ser Pro 
660 665 670 

Val Glu His Tyr Leu Lys He Leu Asn Glu Gin Ala Ala Lys Val Gly 
675 680 685 

Asp Lys Thr Glu Leu Phe Lys Asp Leu Ser Asp Phe Pro Leu He Lys 
690 695 700 

Lys Arg Lys Asp Glu He Gin Gly Val He Asp Glu He Arg Met His 
705 710 -715 ^O 

Leu Gin Glu He Arg Lys He Leu Lys Asn Pro Ser Ala Gin Tyr Val 
725 ,730 735 

Thr Val Ser Gly Gin Glu Phe Met He Glu He Lys Asn Ser Ala Val 
740 ™5 75 0 

Ser Cys He Pro Thr Asp Trp Val Lys Val Gly Ser Thr Lys Ala Val 
755 760 765 
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Ser Arg Phe His Ser Pro Phe lie Val Glu Asn Tyr Arg His Leu Asn 
770 775 780 

Gin Leu Arg Glu Gin Leu Val Leu Asp Cys Ser Ala Glu Trp Leu Asp 
785 790 795 800 

Phe Leu Glu Lys Phe Ser Glu His Tyr His Ser Leu Cys Lys Ala Val 
805 810 815 

His His Leu Ala Thr Val Asp Cys lie Phe Ser Leu Ala Lys Val Ala 
820 825 830 

Lys Gin Gly Asp Tyr Cys Arg Pro Thr Val Gin Glu Glu Arg Lys lie 
835 840 845 

Val lie Lys Asn Gly Arg His Pro Val lie Asp Val Leu Leu Gly Glu 
850 855 860 

Gin Asp Gin Tyr Val Pro Asn Asn Thr Asp Leu Ser Glu Asp Ser Glu 
865 870 875 880 

Arg Val Met He He Thr Gly Pro Asn Met Gly Gly Lys Ser Ser Tyr 
885 890 895 

lie Lys Gin Val Ala Leu lie Thr He Met Ala Gin He Gly Ser Tyr 
900 905 910 

Val Pro Ala Glu Glu Ala Thr He Gly He Val Asp Gly He Phe Thr 
915 920 925 

Arg Met Gly Ala Ala Asp Asn He Tyr Lys Gly Arg Ser Thr Phe Met 
930 935 940 

Glu Glu Leu Thr Asp Thr Ala Glu He He Arg Lys Ala Thr Ser Gin 
945 950 955 960 

Ser Leu Val He Leu Asp Glu Leu Gly Arg Gly Thr Ser Thr His Asp 
965 970 975 

Gly He Ala He Ala Tyr Ala Thr Leu Glu Tyr Phe He Arg Asp Val 
980 985 ■ , 990 

Lys Ser Leu Thr Leu Phe Val Thr His Tyr Pro Pro Val Cys Glu Leu 
995 1000 1005 

Glu Lys Asn Tyr Ser His Gin Val Gly Asn Tyr His Met Gly Phe Leu 
1010 1015 1020 



40 



WO 02/054856 



PCT/US01/00934 



Val Ser Glu Asp Glu Ser Lys Leu Asp Pro Gly Ala Ala Glu Gin Val 
1025 1030 1035 1040 

Pro Asp Phe Val Thr Phe Leu Tyr Gin lie Thr Arg Gly He Ala Ala 
1045 1050 1055 

Arg Ser Tyr Gly Leu Asn Val Ala Lys Leu Ala Asp Val Pro Gly Glu 
1060 1065 1070 

He Leu Lys Lys Ala Ala His Lys Ser Lys Glu Leu Glu Gly Leu lie 
1075 1080 1085 

Asn Thr Lys Arg Lys Arg Leu Lys Tyr Phe Ala Lys Leu Trp Thr Met 
1090 1095 1100 

His Asn Ala Gin Asp Leu Gin Lys Trp Thr Glu Glu Phe Asn Met Glu 
1105 1110 1115 1120 

Glu Thr Gin Thr Ser Leu Leu His 
1125 



<210> 29 

<211> 4374 

<212> DNA 

<213> Homo sapiens 

<400> 29 

gggcacgagc cctgccatgt ctcgccggaa 
ctcagcccct gcgaggcaag cggttttgag 
atccacctcc tcctccacag gtgcagccga 
agcgccccca gcgcccgcct tcccgcccca 
cagaagaaag aagagaccat tggaaaatga 
ccaacaaaag gaaggaggaa gtgatctggg 
tctgaggacc aggaatgttt caaagtctct 
tgcccttcct caaagtagag tccagacaga 
aaaatgtact gattttgatg atatcagtct 
agattcgaaa cgtcaaatta atcaaaagga 
atcatcaaat acaagtcatg aaaatttaca 
gtccaaaagc atctatacgc cgctagaatt 
agatgcagtt ttgtgtgtgg aatgtggata 
gattgcagcc cgagagctca atatttattg 
tatacctact cacagactgt ttgttcatgt 
gggagttgtg aagcaaactg aaactgcagc 
actcttttcc cggaaattga ctgcccttta 
gaatccccta atcaagctgg atgatgctgt 
taccagctat cttctgtgca tctctgaaaa 
caacattttt attggcattg tgggagtgca 



gcctgcgtcg ggcggcctcg ctgcctccag 60 

ccgattcttc cagtctacgg gaagcctgaa 120 

ccaggtggac cctggcgctg cagcggccgc 180 

gctgccgccg cacgtagcta cagaaattga 240 

tgggcctgtt aaaaagaaag taaagaaagt 300 

aatgtctggc aactctgagc caaagaaatg 360 

ggaaaaattg aaagaattct gctgcgattc 420 

atctctgcag gagagatttg cagttctgcc 480 

tctacacgca aagaatgcag tttcttctga 540 

cacaacactt tttgatctca gtcagtttgg 600 

gaaaactgct tccaaatcag ctaacaaacg 660 

acaatacata gaaatgaagc agcagcacaa 720 

taagtataga ttctttgggg aagatgcaga 780 

ccatttagat cacaacttta tgacagcaag 840 

acgccgcctg gtggcaaaag gatataaggt 900 

attaaaggcc attggagaca acagaagttc 960 

tacaaaatct acacttattg gagaagatgt 1020 

aaatgttgat gagataatga ctgatacttc 1080 

taaggaaaat gttagggaca aaaaaaaggg 1140 

gcctgccaca ggcgaggttg tgtttgatag 1200 
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tttccaggac tctgcttctc gttcagagct 
agagctgctg cttccttcgg ccttgtccga 
atctgttagt gtgcaggatg acagaattcg 
atacagccat gctttccagg cagttacaga 
aggttctcaa attatttctg gcattgttaa 
tgccatcata aaatacctca aagaattcaa 
ttttaaacag ctatcaagta aaatggaatt 
.tctggaaatc ctacagaatc agactgatat 
agaccacact aaaacttcat ttgggagacg 
ccttaaatta agggaaataa atgcccggct 
atctagtgtg tttggtcaga tagaaaatca 
actctgtagc atttatcaca aaaaatgttc 
tttatatcac ctaaagtcag aatttcaagc 
gtcagacttg ctccggaccg ttattttaga 
ttacttaaag atactcaatg aacaagctgc 
agacctttct gacttccctt taataaaaaa 
cgagatccga atgcatttgc aagaaatacg 
tgtgacagta tcaggacagg agtttatgat 
accaactgat tgggtaaagg ttggaagcac 
tattgtagaa aattacagac atctgaatca 
tgctgaatgg cttgattttc tagagaaatt 
agtgcatcac ctagcaactg ttgactgcat 
agattactgc agaccaactg tacaagaaga 
ccctgtgatt gatgtgttgc tgggagaaca 
atcagaggac tcagagagag taatgataat 
ctacataaaa caagttgcat tgattaccat 
agaagaagcg acaattggga ttgtggatgg 
tatatataaa ggacggagta catttatgga 
aaaagcaaca tcacagtcct tggttatctt 
tgatggaatt gccattgcct atgctacact 
aaccctgttt gtcacccatt atccgccagt 
ggtggggaat taccacatgg gattcttggt 
cgcagcagaa caagtccctg attttgtcac 
agcaaggagt tatggattaa atgtggctaa 
gaaagcagct cacaagtcaa aagagctgga 
caagtatttt gcaaagttat ggacgatgca 
ggagttcaac atggaagaaa cacagacttc 
tgaacaaaaa atggagaatt aaaaatacca 
tatctttgtg tgacatgtga gcataaaatt 
agaggttttt ctgaagacag tctttttcaa 
aacactcttg aatagacttc cactttgtaa 
aaagccttaa gtggcagaat ataattccca 
tgatattttt atttgtttca gttcagataa 
atccattgaa ctaaaataat tttattatgc 
tttttataag tagaaagaat tggccaggca 
tgggaggcca aggtaggcag atcacctgag 
tggcaaaacc ccatctttac taaaaatata 
ttagctgggc atggtggcgc acacctgtag 



agaaacccgg atgtcaagcc tgcagccagt 1260 
gcaaacagag gcgctcatcc acagagccac 1320 
agtcgaaagg atggataaca tttattttga 1380 
gttttatgca aaagatacag ttgacatcaa 1440 
cttagagaag cctgtgattt gctctttggc 1500 
cttggaaaag atgctctcca aacctgagaa 1560 
tatgacaatt aatggaacaa cattaaggaa 1620 
gaaaaccaaa ggaagtttgc tgtgggtttt 1680 
gaagttaaag aagtgggtga cccagccact 1740 
tgatgctgta tcggaagttc tccattcaga 1800 
tctacgtaaa ttgcccgaca tagagagggg 1860 
tacccaagag ttcttcttga ttgtcaaaac 1920 
aataatacct gctgttaatt cccacattca 1980 
aattcctgaa ctcctcagtc cagtggagca 2040 
caaagttggg gataaaactg aattatttaa 2100 
gaggaaggat gaaattcaag gtgttattga 2160 
aaaaatacta aaaaatcctt ctgcacaata 2220 
agaaataaag aactctgctg tatcttgtat 2280 
aaaagctgtg agccgctttc actctccttt 2340 
gctccgggag cagctagtcc ttgactgcag 2400 
cagtgaacat tatcactcct tgtgtaaagc 24 60 
tttctccctg gccaaggtcg ctaagcaagg 2520 
aagaaaaatt gtaataaaaa atggaaggca 2580 
ggatcaatat gtcccaaata atacagattt 2640 
taccggacca aacatgggtg gaaagagctc 2*700 
catggctcag attggctcct atgttcctgc 2760 
cattttcaca aggatgggtg ctgcagacaa 2820 
agaactgact gacacagcag aaataatcag 2880 
ggatgaacta ggaagaggga cgagcactca 2940 
tgagtatttc atcagagatg tgaaatcctt 3000 
ttgtgaacta gaaaaaaatt actcacacca 3060 
cagtgaggat gaaagcaaac tggatccagg 3120 
cttcctttac caaataacta gaggaattgc 3180 
actagcagat gttcctggag aaattttgaa 3240 
aggattaata aatacgaaaa gaaagagact 3300 
taatgcacaa gacctgcaga agtggacaga 3360 
tcttcttcat taaaatgaag actacatttg 3420 
actgtacaaa ataactctcc agtaacagcc 3480 
atgaccatgg tatattccta ttggaaacag 3540 
gtttctgtct tcctaacttt tctacgtata 3600 
ttagaaaatt ttatggacag taagtccagt 3660 
agcttttgga gggtgatata aaaatttact 3720 
ttggcaactg ggtgaatctg gcaggaatct 3780 
aaccagttta tccaccaaga acataagaat 3840 
tggtggctca tgcctgtaat cccagcactt 3900 
gtcaggagtt caagaccagc ctggccaaca 3960 
aagtacatct ctactaaaaa tacgaaaaaa 4020 
tcccagctac tccggaggct gaggcaggag 4080 
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gcttgggcaa cagagcaaga "aaa^cat gaaatttaaa aggttaaat. 4260 

sess .« «=, ssu t rr .., «» 

taSa"« ttaat'aata cttgctataa aaaaaaaaaa aaaaaaaaaa aaaa 



<210> 30 
<211> 20 
<212> DMA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence oligonucleotide 
primer 

<400> 30 20 
gatatctcca ctgacgtaag 



<210> 31 
<211> 19 
<212> DMA 

<213> Artificial Sequence 

< 2 2 23> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 31 19 
tgttgccggt cttgcgatg 



<210> 32 
<211> 21 
<212> DMA 

<213> Artificial Sequence 

tit Description of Artificial Sequence oligonucleotide 
primer 

<400> 32 21 
cccgatctag taacatagat g 



<210> 33 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 

<220> , , u . , 

<223> Description of Artificial Sequence oligonucleotide 

primer 
<400> 33 

cagtctggat cgcgaaaact g 



<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> , _ , 

<223> Description of Artificial Sequence : oligonucleotide 

primer 

<400> 34 2Q 
ggtgattacc gacgaaaacg 



<210> 35 
<211>-20 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: oligonucleotide 
primer 

<400> 35 2Q 
agtgaagggc gaacagttcc 



<210> 36 
<211> 18 
<212> DNA 

<213> Artificial Sequence < f 
<220> 

<223> Description of Artificial Sequence -.oligonucleotide 
primer 

<400> 36 

gagtattgcc aacgaacc 
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<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence -.oligonucleotide 
primer 

<400> 37 

gtatcaccgc gtctttgatc 



<210> 38 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence oligonucleotide 
primer 

<400> 38 19 
cgaaacgcag cacgatacg 



<210> 39 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 39 19 
gttcaacgct gacatcacc 



<210> 40 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence toligonucleotxde 



45 



WO 02/054856 



PCT/US01/00934 



primer 
<400> 40 

catgttcatc tgcccagtcg 



<210> 41 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 41 

gctttggaca taccatcc 



<210> 42 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

<400> 42 

caccgaagtt catgccag 



<210> 43 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : oligonucleotide 
primer 

r 

r 

<400> 43 

tgactacttt tgacttcagc c 



<210> 44 
<211> 22 
<212> DNA 
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<213> Artificial Sequence 

<220> , 
<2 23> Description of Artificial Sequence: oligonucleotide 

primer 

<400> 44 22 
aaccattcaa catttttaac cc 
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